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EDITOR’S PREFACE 


The Yearbook on Nature and Nurture has grown out of a sug- 
gestion made by Dr. L. M. Terman in December, 1923, that the 
Society might profitably initiate some research on the limitations 
of educability, might help to answer the question whether edu- 
cational effort can or can not make bright children out of dull 
ones. The Executive Committee of the Society, at a meeting held 
at Detroit, January, 1924, acted favorably upon the suggestion 
and requested Dr. Terman, in collaboration with Dr. Bagley, to 
select a suitable committee to undertake the work. In April, 1924, 
the plans were formally endorsed and a Committee appointed to 
produce a Yearbook on the ‘‘ Possibilities and Limitations of Train- 
ing.’? Further development of the plans has been set before the 
Society already (for example, in the 24th Yearbook, Part II, pp. 
386-8, and in the 25th Yearbook, Part II, pp. 254-6). The change 
of title of the Yearbook itself does not, of course, involve any essen- 
tial change in the subject matter dealt with by the Committee dur- 
ing the four years the work has been under way. 

The Board of Directors appropriated for the expenses of this 
Yearbook Committee $700 in April, 1924, $600 in October, 1925, 
and a special fund of $500 in February, 1927. These sums, how- 
ever, represent only a fraction of the money expended in procuring 
the data that follow. Four of the major contributions—the Chi- 
cago and Stanford investigations of foster children (Chapters IX 
and X of Part I), the Hollingworth-Cobb investigation (Chapter 
I of Part II), and the Heilman investigation (Chapter II of Part 
II)—have been facilitated by grants from the Commonwealth 
Fund, from Mr. Max Rosenberg, from Stanford University, from 
the Institute of Educational Research of Teachers College, from the 
Carnegie Corporation of New York, from the Colorado State Teach- 
ers College, and from the Denver Public Schools. The Society 
expresses its appreciation of these contributions; explicit acknowl- 
edgment of them is made elsewhere in these volumes. Many other 
organizations and institutions have also assisted us, especially 
through permitting their representatives to devote their time and 
energy to the gathering of data and preparation of contributions. 
If these contributions and the personal efforts of the numerous 
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persons concerned could be computed also in terms of dollars and 
cents, the investment of capital in the production of the Twenty- 
Seventh Yearbook would indeed be impressive. The book presents 
a splendid example of the kind of codperative research which the 
Board of Directors has aimed to develop through its policy of 
establishing committees of experts, supplying them directly or in- 
directly with adequate funds, over a period of several years, and 
providing a suitable medium for the publication and dissemination 
of their reports. 

The fundamental importance of the topic under discussion in 
this Yearbook is clearly set forth in Dr. Terman’s introductory 
chapter. Nothing more need be said on that point, but it may 
be well here to warn the reader that the problem is not only im- 
portant, but also difficult. It has been impossible in this volume 
to preserve the relatively simple style of presentation that has 
characterized many of our yearbooks which were designed to be 
directly helpful to the classroom teacher. It has been necessary in 
the present series of studies to employ intricate methods of ap- 
proach and complex statistical methods of treatment. That could 
not be avoided, but the committee has sought to assist the lay 
reader by prefacing each chapter with a brief orientation, ex- 
plaining the purpose, method, and outcome of the study in ques- 
tion and its relation to the general problem of the Yearbook. 

Those who think of the nature-nurture issue as a fruitful field 
for rhetorical debate will be disappointed in the pages that follow. 
No one has tried to be controversial. Every contributor has sought 
to be straightforward and objective. The aim in this Yearbook 
has not been to compare opinions (however helpful that may have 
been in dealing with other educational issues), but to determine 
facts. 

With these explanations in mind, the editor trusts the mem- 
bers of the Society will approve of the content and style of the 
Twenty-Seventh Yearbook, despite its technical vein and the some- 
what abstruse character of many of the contributions. 


G. M. WHIPPLE 


GENERAL INTRODUCTION TO 
THE 1928 YEARBOOK 


Lewis M. TERMAN 
Stanford University 


Chairman of the Yearbook Committee 

Background of the 1928 Yearbook. For several years a lively 
controversy has waged among school people and psychologists with 
reference to the causes and significance of the large individual dif- 
ferences which intelligence and achievement tests have disclosed. 
The issues involved are obviously of great consequence for educa- 
tional theory and practice, since they are part and parcel of the 
age-old problem as to the relative influence of nature and nurture 
upon human destinies. Ifthe differences found are due in the main 
to controllable factors of environment and training, then, theoret- 
ically, at least, they can be wiped out by appropriate educational 
procedures—procedures which it would then become our duty to 
provide. On the other hand, if they are due primarily to differ- 
ences in original endowment, then the duty of the school is clearly 
to provide for differentiated training which will take these native 
differences into account. 

Now any proposal to apply in practice the doctrine of the 
biological inequality of human beings, whether in politics or edu- 
cation, impinges upon some of our deepest-lying prejudices and 
challenges our fundamental philosophies of life. It was only nat- 
ural, therefore, that all too frequently the controversy should have 
descended from the level of calm discussion of the pertinent scien- 
tifie evidence to that of acrimonious asseveration and emotional 
appeal. Rightly or wrongly, some have felt that educational democ- 
racy is at stake, and any threat, fancied or real, against the demo- 
cratic ideal of public education arouses the same antagonism as a 
threat to the democratic principle of political control. Catch 
phrases, like ‘‘educational determinism’’ or ‘‘democracy and the 
1.Q.,’’ have become as charged with emotion as were once such 
slogans as ‘‘states rights,’’ ‘‘abolition,’’ or ‘‘taxation without rep- 
resentation.’’ The belief in native differences and in the validity 
of intelligence tests as a method of measuring them is looked upon 
by some of the environmentalists as belonging to the same category 
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as the theological doctrine of infant damnation. Unfortunately, 
the dogma of certain of the environmentalists has too often been 
answered by the dogma of hereditarians. Too many on both sides 
have shown more interest in finding and phrasing arguments to 
support a theory than in investigating the problem by the methods 
of science. 

Not that there has not been ample reason in the past why 
speculations rather than facts have tinged our thinking upon this 
question. As a scientific problem, this one is exceedingly young. 
Until Galton (without proving whether the fact was due to nature 
or to nurture) showed, when he published his Hereditary Genius 
in 1869, that high ability has a strong tendency to cluster in family 
lines, no one had ever made an extensive investigation in this field. 
Although Hereditary Genius was the stimulus for a few lesser 
studies which followed it closely in time, it was not until the be- 
ginning of the twentieth century that concerted resources were de- 
voted to unraveling the problem. Even then the early attempts 
were of necessity tentative and incomplete, partly because the prob- 
lem itself was as yet merely sensed rather than clearly formulated, 
partly because a large amount of preliminary exploratory work was 
essential before the lines of attack most likely to be fruitful and 
crucial could be appreciated, and partly because refined methods 
of measuring intellectual abilities were not yet available. 

Possibly one misconception (now fortunately clarified by recent 
contributors to the nature-nurture field) has clouded the pertinent 
issues as much as any single other. This is the notion that a 
‘nature’ theory implies a type of ‘glass bottle’ mental development 
which is quite independent of any stimulation from environment. 
No idea could be more misleading, and no phenomenon more im- 
possible. The theoretical as well as the practical nature-nurture 
problem that faces us deals with human beings as they are—not as 
an experimenter might rear them in a nurture-proof laboratory. 
We are interested in the child or adult as he comes to us—with his 
unique complex of ancestry, associates, home training, schooling, 
and physical and moral attributes. We are interested, not in find- 
ing out how he would have developed if he had had no environment 
at all; rather we wish to discover whether or not he can be made « 
a more intelligent individual om a more learned one by improving 
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the conditions of his miliew within the limits found in reasonably 
good social communities. More generally, we wish to find the rela- 
tive potency of all types of human environment to add to, or to 
detract from, human endowment, and to know the limits placed 
upon achievement by endowment. 

Inception of the Present Yearbook Project. The situation set 
forth in the preceding paragraphs was discussed in the February, 
1924, meeting of the Board of Directors of the National Society for 
the Study of Education, and following this discussion an agreement 
was reached to devote an early yearbook to the topic, ‘‘The Possi- 
bilities and Limitations of Training.’’ Shortly afterward I was 
invited by the Board of Directors to form a committee for the 
preparation of a yearbook devoted to this subject. I agreed to do 
so on condition that the project be made a research undertaking 
rather than a spectacular combat of educational gladiators. In 
view of the large amount of heated discussion that had taken place, 
it seemed best to devote this yearbook almost entirely to the dis- 
passionate exposition of the results of new investigations bearing 
on the nature-nurture problem, with only a minimum of space de- 
voted to orientational and interpretative contributions. 

A committee was at once formed which was favorable to such 
a program. The Committee was selected so as to represent all shades 
of opinion on the.issues involved, for it seemed very important to 
avoid the suspicion of making the Yearbook the special mouthpiece 
of either side to the controversy. Whatever faults may be found 
with the outcome of the Committee’s work, I think it can be said 
that all of its members have shown throughout an objective-minded, 
investigational attitude. The sole concern of the Committee has 
been to bring together all the new evidence it was possible to secure 
on either side of the question. This method of approach once 
agreed upon, one finds that it is possible for persons of the most 
antagonistic beliefs to cooperate effectively in a program of research. 

Activities of the Committee. The Committee has had only two 
meetings, one in Chicago in October, 1925, the other in Philadelphia 
in December, 1926. The Chicago meeting was fully attended and 
occupied an entire day. The Philadelphia meeting was briefer 
and could be attended only by Baldwin, Brigham, Terman, and 
Whipple. Fortunately, the Committee chairman (Terman) and 
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the Yearbook editor (Whipple) were able to hold a series of con- 
ferences at Stanford University in May and June, 1927, and to 
agree upon various matters with reference to the selection and 
arrangement of material. 

Previous to the first meeting of the committee, the general plan 
of the Yearbook had been determined upon by correspondence. This 
plan included the following specifications: 

1. The Yearbook should be devoted to the nature-nurture prob- 
lem, with special reference to the relative influence of these factors 
upon scores in intelligence and achievement tests. 

2. The major part of the space should go to the exposition of 
new investigational results. 

3. To this end, effort should be made to secure the codperation 
of as many scientific workers as possible in the conduct of researches 
bearing on various aspects of the problem. 

4. Investigations tending to emphasize either the possibilities 
or the limitations of training were to be equally welcomed. 

5. Contributors would be expected to confine themselves pri- 
marily to the exposition of factual data and to the brief statement 
of their conclusions based thereon. 

In October, 1925, an appeal for codperation was mailed to 
nearly two hundred persons throughout the country, and this was 
followed by two published appeals, one in the Journal of Educa- 
tional Research, May, 1926, the other in School and Society, March, 
27, 1926. Systematic personal enquiries were made for the purpose 
of getting in touch with investigations already under way which 
might have a bearing on the Yearbook’s problem. Other investi- 
gators were urged to undertake special researches for the purpose. 
The two large and meaty volumes here offered are evidence enough 
of the response secured. The table of contents for the two volumes 
shows a wide variety in the special phases of the problem which 
were attacked. The part devoted to factors influencing intelligence- 
test scores covers such broad and basic topies as ‘‘home environ- 
ment,’’ and such special ones as ‘‘coaching.’’ The part dealing with 
achievement test scores presents critical data upon a number of 
factors, such as school expenditures and school attendance, whose 
efficacy to produce educational results we have always taken for 
granted in the past. The section on special mental traits offers upon 
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several important problems material which is virtually pioneer 
work. It would not be easy to estimate the total labor cost which 
the investigations of these two volumes represent. In all proba- 
bility it would bulk to more than $100,000 if bought and paid for 
in the market. Several of the studies would not have been made 
at this time, had it not been for the stimulus offered by the Year- 
book project. 

Acknowledgments. In behalf of the Committee I wish to thank 
all those who have contributed so liberally of their time and effort 
to make this Yearbook a success. In many cases the contributions 
have been made at considerable personal sacrifice. The Committee 
appreciates not only the contributions as such, but also the non- 
partisan and scientific spirit in which they have been made. There 
has been little or no rhetorical straining for effect in the exposi- 
tion or interpretation of results. Few treatments of the nature- 
nurture problem have consistently maintained so high a level of 
open-mindedness and intellectual honesty. 

Grateful acknowledgment is made to the Commonwealth Fund 
for a grant of $15,000, divided equally between Chicago and Stan- 
ford Universities, in support of the foster children studies. The 
Stanford grant was supplemented by a contribution of approxi- 
mately $2,500 from the Thomas Welton Stanford Fund for 
Psychological Research and by a gift of $1200 from Mr. Max Rosen- 
berg, of San Francisco. The State Teachers College of Greeley, 
Colorado, and the publie school officials of Denver codperated help- 
fully in making possible the extensive and laborious investigation 
by Dr. Heilman. Many other educational institutions and many 
individuals, not all of whom it is possible to mention by name, have 
contributed in important ways to the carrying out of the investi- 
gations herein reported. 

Special acknowledgement is made of the invaluable assistance 
rendered by Barbara Stoddard Burks, now Mrs. Herman Rams- 
perger, of Stanford University. Besides carrying out the Stanford 
foster-children investigation and preparing the masterly summary 
of nature-nurture literature, Miss Burks served throughout the 
period of the Yearbook’s preparation as my research and secretarial 
assistant. In this capacity she not only conducted the major part © 
of the routine correspondence, but in addition gave valuable as- 
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sistance to several contributors, both in planning their researches 
and in reporting their results. Following her suggestions, several 
chapters of the Yearbook were largely rewritten and many sta- 
tistical procedures were modified. Probably no other single in- 
dividual deserves more credit for whatever merits the Yearbook 
possesses. 


Suggestions for Use of the Yearbook. The amount of real ser- 
vice which these volumes will render to educational theory and 
practice will depend in no small degree upon the spirit in which 
their content is approached. In the first place, readers who do not 
sufficiently appreciate the inherent difficulties and complexities of 
the problem, particularly the difficulty of securing factual data 
that are at once crucial and unambiguous, will perhaps be dis- 
appointed in the lack of perfect consistency in the results of various 
contributors. For certainly it must be admitted that no final an- 
swer to the nature-nurture question has been attained or even ap- 
proximated. The most that can justly be claimed is that the 
bounds of our knowledge have been in some measure extended. 
Anyone who has reasonable regard for the fundamental import- 
ance of the problem will be grateful for any advance in our know]l- 
edge, however slight, which the contributions of this Yearbook have 
accomplished. It is conceivable that the elusive nature of the prob- 
lem is such as to preclude for a long time to come, if not forever, 
a complete and final solution. One needs to cultivate patience and 
the faculty of suspending judgment. 

Secondly, in his study of the investigations reported the reader 
is urged to divest himself, in so far as it is humanly possible, of 
his emotional bias and predilections. Because of unconscious fact- 
ors that are so likely to play a réle, one can not always be sure 
whether one has or has not attained to a state of complete intel- 
lectual detachment, but the effort should nevertheless be made. 
How absurd, after all, is the attitude of mind which would force 
truth into any kind of preconceived mold, when only truth that 
is genuine and undistorted can give us the control over human 
nature and human destinies that our various institutions, includ- 
ing the school, are intended to exercise! Surely, anyone who is 
imbued at all with the spirit of modern science should be less con- 
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cerned about what is true than about knowing what 1s true. In 
order to counteract the inevitable human tendency to bias, the 
reader will do well to make a conscientious effort to find merit in 
those investigations which lend least support. to the beliefs he has 
previously been inclined to favor. It is so easy, as Darwin long 
ago warned us, to forget the evidence that does not happen to 
agree with our theories! 

Finally, I wish to express the profound hope of the Committee 
that numerous readers of the Yearbook will be stimulated by the 
very shortcomings and imperfections of the investigations reported 
to undertake researches which will be better planned, better con- 
trolled, more extensive, and therefore more conclusive in their 
results. It is by this standard, ultimately, that the success of our 
efforts must be judged. 


PREFATORY NOTE TO CHAPTER I 


This is the first of the series of contributions assembled in 
Part II of the Yearbook in which the attempt is made to measure 
the relative effect of heredity and environment upon achievement. 

The investigation reported by Mrs. Hollingworth and Miss Cobb 
is unique in one respect at least. Ordinarily, when seeking to 
measure the possible effect of superior intelligence upon educational 
achievement, a comparison is made between the school achieve- 
ments of pupils of average intelligence and those of pupils of 
superior intelligence or between the school achievements of pupils 
of average intelligence and those of pupils of inferior intelligence 
or perhaps between the school achievements of pupils of superior 
and of inferior intelligence. In any such comparison there enters 
a serious disturbing factor—that of the home conditions or the 
cultural surroundings or the educational opportunities of the two 
groups in question; these environmental conditions are usually 
more or less markedly different and they must affect, to an extent 
difficult to compute, the actual school achievements of the two 
groups. 

In the present investigation the two groups under comparison 
both possess markedly superior intelligence; the homes from which 
they come appear, from careful cultural rating, to be indistinguish- 
able; moreover, the actual school opportunities have been as care- 
fully equalized as human ingenuity could suggest. Under these 
conditions, there nevertheless appears a series of divergences in 
educational achievement, ranging from none at all in some types 
of school work to marked divergence in others. These divergences 
are presented in a series of tables and graphs and lead to the gen- 
eral conclusion that the possessor of an intelligence quotient of 
165 will distinctly surpass the possessor of an intelligence quotient 
of 145 in certain, relatively complex, lines of educational achieve- 
ment, even though the home and school surroundings of the two 
children are equal, the one to the other, in the various factors that 
may favor school progress. 


CHAPTER I 


CHILDREN CLUSTERING AT 165 1.Q. AND CHILDREN 
CLUSTERING AT 146 1.Q. COMPARED FOR 
THREE YEARS IN ACHIEVEMENT! 


Leta 8. HoLLINcwoRTH 
AND 
MARGARET V. COBB 
Teachers College, Columbia University, New York City 


The investigation here reported was formulated in 1922 to dis- 
cover whether children who, by available tests of intelligence, 
cluster around a mean at +8 P.E. will outstrip in achievement 
over a period of years those clustering at +6 P.E., provided im- 
portant factors other than I.Q. are equalized for the groups under 
comparison. 

It has been previously established with reasonable certainty 
that children who test by available methods at low degrees of in- 
tellect cannot learn as well as children generally can; and that 
those who test at high degrees of intellect learn much more rapidly 
and adequately than children in general do. These results have 
convinced many that innate educability is tested by these methods, 
and that it is now possible by such means te classify children for 
differentiated education, justly and humanely. Others have, how- 
ever, remained unconvinced, pointing to the positive correlation 
between degree of intellect in pupils on the one hand, and quality 
of parents, home, and school on the other. Since children who are 
dull by test and who learn poorly come usually from ‘unprivileged’ 
homes, it is inferred that not educability but educational oppor- 
tunity is the basis of both intelligence quotient and curve of learn- 


>This report is rendered as part of the work of a Joint Committee, for 
three years in charge of special opportunity classes for gifted children at 
Public School 165, Manhattan. The members of this Committee were Mr. 
Jacob Theobald and Miss Jane E. Monahan, of Public School 165, Miss Mar- 
garet V. Cobb, Dr. Grace A. Taylor, and Professor Leta 8. Hollingworth, of 
Teachers College, Columbia University. ‘The work was carried on with the 
advice of District Superintendent John E. Wade, and in codperation with the 
Division of Educational Psychology, of the Institute of Educational Research, 
at Teachers College. The work was supported by funds granted through the 
Institute by the Carnegie Corporation of New York. 
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ing. (Such reasoning would no doubt also explain the fact that chil- 
dren who ‘test high’ are accelerated in school status, by saying 
that they are bright because they are accelerated, rather than by 
saying that they are accelerated because they are bright.) 


SELECTION OF SUBJECTS 


In order to avoid the ambiguities arising when stupid children 
(who usually have inferior social and educational environment) 
are compared in learning with bright children (who usually have 
excellent social and educational environment), we here present two 
groups, both testing in the highest one half of one percent of the 
total population for intelligence as shown by test. From long ac- 
quaintanceship with the social and educational backgrounds of 
young children testing in the top one-half of one percent, we be- 
lieved that children in the highest one quarter of one per cent 
differ little if at all from those in the next to the highest one quar- 
ter of one per cent in such matters as excellence of home environ- 
ment (insofar as this can be measured in material terms) and of 
educational opportunity. Yet so wide is the range of intellect in- 
cluded in the highest one-half of one per cent, that it is possible in a 
school population of a million children, such as is found in the pub- 
lic schools of New York City, to collect two groups of pupils, both 
falling within this designated percentile status, yet separated from 
each other by a distance of about 2 P.E., or about 20 points of I.Q. 
In our selection of children we aimed at a mean of 145 I.Q. for one 
group, called our ‘‘Lower Group,’’ and of 165 J.Q. for the other, 
ealled our ‘‘Higher Group.’’ In matching the children we actually 
achieved a mean of 146 I.Q. for the one and of 165 I.Q. for the 
other. These compared groups were formed and finally selected in 
the following manner. 

Our Committee, to which reference has been made, was orgain- 
ized in the spring of 1922 to select and to conduct for three years, 
a special class of young children all testing at or above 150 I.Q. 
In searching for these children it was inevitable that a large per- 
centage of promising candidates should be found to test below 150, 
but above 135 I.Q. A group of such children was thus incidentally 
identified; and the suggestion arose to form a special class for them 
also, conducted with conditions and opportunities identical (except 
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as stated later in this discussion) with those planned for the group 
above 150 1.Q., with the purpose of ascertaining whether or not, 
among children all so highly intelligent, differences of scholastic 
achievement would appear in accordance with differences in I.Q. 
as determined by available methods. It seemed to us that groups 
selected as these were selected, would not differ from each other in 
achievement, unless intellect as measured is the chief determiner 
of such a difference. 

During the three years of experimentation in the special oppor- 
tunity classes thus initiated at Public School 165, Manhattan, the 
registers of both classes were maintained at slightly above 25 
pupils, which is the minimal register administratively permissible. 
Some of the children dropped out from time to time, and were re- 
placed by others, since a certain amount of ‘turnover’ is unavoid- 
able through change of residence and similar accidents. In May, 
1925, the investigators were able to list 40 children who had been, 
and continued to be, members of these special classes from the be- 
ginning of the experiment to its close, and who could be paired in 
the manner determined upon. Twenty of these tested from 152 to 
183 1.Q. (Stanford-Binet). Each of these 20 children was then 
paired with another child of the same age who tested from 1 P.E. to 
4 P.E. lower in I.Q. The Lower Group thus showed I.Q.’s ranging 
from 139 to 156. The mean I.Q. of the Higher Group proved to be 
164.9; that of the Lower Group, 146.1. 

During the three years of experimentation, each child had been 
tested annually by means of the Stanford Binet. The I.Q.’s upon 
which matching was based in May, 1925, are in every case the means 
of these three annual tests. The relative status of each child within 
the two groups is thus a little more reliably determined than would 
have been the case with one test only,? since the correlation of 
the Stanford Binet with itself on annual trials is not above .95. 

Our ideal was to match each pair for birthday age within a 
month, but in several cases this ideal was not quite realized. The 
difference between the groups in mean age gives a slight advantage 
of maturity to our Lower Group (see Table I). This advantage, 


7Tn at least one case the mean I.Q. is known to be a little too low, because 
the final I.Q. dropped, on account of the limitations of the scale at the upper 
extreme. This is the case of the child of Pair 1, whose I.Q. is 183 as the mean 
of the three tests. His first test at the age of 9 years yielded an I1.Q. of 190. 
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insignificant as it is, is ‘spread’ over the members of the Lower 
Group, and does not result from but one instance of a much older 
child in this group. In obtaining our 20 matched pairs, we ex- 
hausted the possibilities among the children available under the 
criteria adopted for matching. 

The children were 100 months old, on the mean, when selected 
for the classes in 1922. They were about 102 months old, on the 
mean, when they entered upon special opportunity in September, 
1922. They were 135 months old when matched for the compara- 
tive groups herein reported, and the mean age was 136 months 
when the experimental work was concluded at the end of June, 
1925. . 

Sex was not kept as a constant in matching pairs, but the groups 
had 5 girls and 15 boys in each of them. The two groups were 
practically identical as regards race; 18 children in each group 
were Jewish. Of the two non-Jewish members of the Higher Group 
one was Irish-German and the other, Scotch-Irish. If the two non- 


Jewish in the Lower Group one was French-English and the other, 
Scotch-Italian. 


TaBiLe I. SHowrna THE Marcuine BY AGE AND INTELLECT OF THE Two 
Groups CoMPARED IN EXPERIMENTATION 


Hiauer Group Lower Group 
Pair |——_—___ sd  — 
Age in Mos. | First I. Q. |Meani.q.) Age in Mos. | First I. Q. |Mean1I.q, 
1 Na eee 144 190 183 144 145 149 
aang 128 172 178 130 135 145 
Scare. 139 175 176 140 145 153 
4 121 166 173 125 149 156 
5 142 al 171 143 140 147 
Gan 138 171 171 139 141 141 
7 124 154 171 129 146 151 
Srreky 131 167 170 132 149 150 
9 140 162 165 139 139 150 
10 129 160 163 133 138 144 
1B eed 136 170 162 135 135 144 
Dee re 132 162 161 133 135 139 
13 137 160 160 135 142 146 
14 143 157 160 146 137 142 
Serre 127 163 159 132 143 148 
Goer 140 164 158 141 144 139 
veneers 142 156 158 147 141 145 
1b Sano 137 141 155 133 138 146 
LON 134 156 152 134 151 143 
PAY Re 131 154 152 134 141 144 


Mean 134.75 163.5 | 164.9 136.20 141.7 | 146.1 
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All of the forty children were selected from the public-school 
population of New York City, many districts of which were repre- 
sented. The extremes of residence were from the Bronx to Staten 
Island; from East Fourteenth Street to West Two Hundred and 
Sixty-Third Street; from Avenue B to Riverside Drive. The 
grouping by age and by I.Q. is shown in Table I. 

After the children were matched, in May, 1925, as shown in 
Table I, the achievements of the two groups as thus constituted 
were computed for the preceding three-year period of opportunity. 
Achievement was not taken into account at all in the matching, 
which was done solely on the basis of age, I.Q., and presence in the 
classes throughout the whole three years of experimentation. 


EQUALIZATION OF EpUCATIONAL OPPORTUNITY 
Time Spent in Graded Classes 


When the matching was completed, the investigators had before 
them two groups of 20 gifted children each, separated by a distance 
of about 2 P.E. on the scale of I.Q. (1 P.E. == 8 1.Q., according 
to Terman’s distribution). All of these children had first attended 
the regular classes of various public schools. It is of interest to 
us to know whether our two groups had already been differentiated 
in achievement by promotions received in these grades before enter- 
ing our special classes. 

After our two comparative groups had been selected and paired, 
we computed the mean school status of each group at the time of 
segregation in September, 1922. The result shows that the Higher 
Group then averaged .8 of a term (semester) higher in school grad- 
ing than did the Lower Group. The overlapping between the two 
groups in this respect was very marked. Relative grading was but 
slightly in accordance with mental age as regards individuals. For 
instance, the child who appears in our Higher Group with an I.Q. 
of 183 was but one year accelerated in school status. At the age of 
9 years he was in the 5th grade, though his M.A. at that time was 
17 years, 4 months. Others of much less intelligence had ‘skipped’ 
two and even three years, according to the policies and insights of 
their various principals and teachers. 
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Time Spent in Special Classes 


Both groups were segregated for special opportunity classes 
at the same early age and were thereafter taught in the same school, 
under the same administrative auspices. Here they both had access 
to the same special library, were taught under the same policy of 
freedom to progress according to capacity, and enjoyed all special 
and usual privileges and opportunities provided. 


Teachers 


Each child had had from two to six different teachers during 
the years spent in the graded classes. Each group had thus been 
subject to the influences of so many different teachers previous to 
segregation as to destroy the influence of any one of them as a 
constant upon the group. 

After segregation, each group was subject to the influence of 
seven different teachers, six of whom taught every child in both 
groups, impartially. The seventh teacher, however, was in each 
instance the classroom teacher, and this teacher did not teach both 
groups impartially. These classroom teachers were constant influ- 
ences for three full years, each for her own group, but not for the 
other to the same extent. It was administratively impossible to 
shift these two teachers in the middle of the experiment, as from 
the research point of view alone would have been desirable. All 
but three of the children in our Higher Group stayed with one 
teacher throughout the entire three years, and all but seven of those 
in our Lower Group stayed with the other teacher throughout. 

We have carefully scrutinized our data for evidence as to 
whether or not the influence of these two teachers might be the 
cause of differences in achievement between the two groups. We 
believe such influences to have been inappreciable for these four 
reasons: 

(1) Each group was placed with its teacher at the outset solely 
on the basis of I.Q.’s. And in September, 1922, before being taught 
by these teachers, the Lower Group was found to have achieved 
less in graded classes scattered over the city than had the Higher 
Group. Subsequent performance is, therefore, consistent with the 
trend of performance previous to experimentation. 

(2) The accomplishment quotients of the two groups would be 
expected to differ if the teachers differ in influence upon achieve- 
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ment. Table II presents the A.Q.’s of the comparative groups at 
four different periods during experimentation. The fact is that 
there is no reliable difference between the groups as regards A.Q. 
Such apparent difference as there is favors the teacher of the Lower 
Group. 


Taste I].—AccompiisHMentT QuoTients or THE Two Groups Durinae Ex- 
PERIMENTATION, ACCORDING TO THE STANFORD ACHIEVEMENT TEST 


Date A. Q. Higher Group | A. Q. Lower Group 
OLS) SLs} LSP Aik, al Sai sas try RIOTS 94.5 91.0 
Junet LO24 Wie its, aol 96.5 99.0 
Dept LOLA coe Seieewin dem Sere ee ese 98.0 100.0 
UNE eLODS ee ere eee ass 96.5 99.5 
FAVELA OMe. poiteee Reset eo meatoee 96.4 97.4 


(3) Furthermore, in September, 1924, nine of the forty pupils 
here involved were shifted from one classroom teacher to the other 
(a few very young children of higher I.Q. were moved to the teacher 
of the Lower Group, and a few older children of lower I.Q. were 
moved to the teacher of the Higher Group). Analysis of the per- 
formance of these children after shifting shows no consistent de- 
parture from the trend for the years preceding. These facts are 
presented in Table III in the form of A.Q.’s for these nine indi- 
viduals for a period of two years. 


Tasie III.—Tue A. Q.’s or INDIVIDUALS SHIFTED FROM ONE TEACHER TO 
ANOTHER IN SEPTEMBER, 1924 
(The first three cases in the table were moved from the teacher of 
the Higher Group to the teacher of the Lower Group. The other six 
were moved from the Lower to the Higher. The first two cases in the 
table belong to our Higher Group, the other seven to our Lower Group.) 


A. Q. (Stanford Achievement Test) 


Child LQ. 
Sept., 1923 | June, 1924 | Sept., 1924 | June, 1925 
APE cL aa ins eee 159 100 104 105 104 
Bene eie aces. 152 95 97 102 101 
(Cin Bey Ri romiee 156 hls 100 104 104 
DE ec ay eto 149 89 93 90 96 
LB eae Sees 145 102 101 106 106 
Liotta Ceong he 141 100 104 106 101 
CN eee tone. 142 97 100 100 100 
Neb ais eg, mee 139 89 97 99 99 
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(4) It would be difficult to show why, if the influence of the 
two teachers in question were the cause of differences in achieve- 
ment, that influence should affect some mental functions, but not 
others. For instance, it would be hard to explain why it should 
show itself sharply in reading, but not in multiplication of whole 
numbers; in spelling, but not in addition of whole numbers. 

From this evidence we infer that the influence of the two class- 
room teachers did not interfere in any appreciable way with the 
equalization of educational opportunity. While it is no doubt true 
that for individual pupils educational opportunity from birth may 
have been somewhat unequal, for the two matched groups it was 
surely equal in all essential and important respects. 


HoME CoNDITIONS 


Home conditions were rated by Miss Cobb. Rating was accord- 
ing to the Whittier Scale, with 25 points as a maximal score for a 
home. Table IV, showing the comparative home ratings for our 
20 pairs, reveals the excellence of the homes from which most of 
these very gifted children come.® 

Two of the children in our Higher Group have poorer homes 
than any child in the Lower Group. These two homes are de- 
cidedly ‘unprivileged,’ located in slum districts, crowded, and de- 
void of books. 


Pouicy oF INSTRUCTION 


The policy of instruction was to let each child learn at his own 
natural rate. Adjustments were made constantly according to 
individual advancement in any subject of study or project covering 
the prescribed curriculum, and also including in the daily pro- 


* As the Whittier Scale does not give scope for a very wide range of home- 
ratings, it does not seem surprising that it should fail to discriminate among 
homes of children all of whom fall in the top one half of one percent as 
regards I.Q. Our data show only that our 146 LQ. group has as many homes 
at or near the maximum for excellence as does the 165 I.Q group, by the 
criteria of the Whittier Scale. 

We were not surprised, in fact, that the ratings (made independently 
of the final composition of our two groups) should have yielded no discrimina- 
tion among the homes of children so homogeneous as to parentage that all rate 
in one half of a given percentile as concerns I.Q. We think it very probable 
that children limited to one half of any percentile on the distribution of I.Q 
would show homes rating alike. 


\ 
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TasBLte IV.—Comparative Home Ratinas ror 20 ExpERIMENTAL Parrs, OF 
HicHer anp Lower Groups, RESPECTIVELY, ACCORDING TO THE 
WHITTIER SCALE FoR Rating Homrs 


y Home Rating Home Rating 
Pair —_——______. Pair ——_——_______- 

H L H L 

Leste tels ates 25 25 Le ettiseiaees 24 24 
DART WO EINES Ct 25 25 1 Re nic sires Oe 17 24 
Cosa eer d 25 23 Loge iles cea 25 25 
BN Aes ack 25 25 TAN ees hire es 25 23 
Dose eee sion icy: 24 24 LD eka cit circ te 23 22 
On fe erecncte 25 24 1G Sin oeoniseme 17 20 
Maret Seisieteen nts 25 24 Tete tere sisters the 24 25 
SngolSodde saas 23 23 Bee eis es 23 24 
OQ eiralevs stele tere crs 20 20 Loe eehccetercetier 21 25 
10 : 25 25 QO Rie atelatte 25 21 

Meantneees 23.30 23.25 


gram much unusual opportunity for education. French, social 
science, the history of civilization, music, biography, physical train- 
ing, were all introduced to enrich the curriculum. These oppor- 
tunities were open to all alike. 

Under these conditions, the children learned as much as they 
would or could, and were tested at intervals to ascertain their 
achievement. Whenever individual difficulties were detected, help 
was given. All had equal access to this assistance. 


Trsts or ACHIEVEMENT 


Standardized tests of achievement were administered by Miss 
Cobb from time to time over the period of three years. Different 
equivalent forms of each test, where such exist, were employed on 
consecutive occasions. All children were tested by the same form 
on any given occasion. Every test was given and scored in exactly 
the prescribed manner, so that no further description of the testing 
is necessary. 

In such prolonged experimentation it is inevitable that a few 
children will be absent on some occasions of testing. It was im- 
possible to have all of these absentees make up every test imme- 
diately. Thus, a few records had to be interpolated. This was 
done, after the first three trials, by giving each absentee the highest 
score which he had reached (or bettered) three times previously. 
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For example, if a child, upon being absent, had three best scores 
of 6, 5, 6, on previous occasions, he was given a score of 5 on the 
occasion of his absence; one having best previous scores of 4, 4, 7, 
would be given a score of 4. 

Between 5 and 10 percent of scores in each group were thus 
inferred. As any error which might arise from this procedure 
would affect both groups equally, it could not exercise an invalidat- 
ing influence upon the results. 

On the Stanford Achievement Tests, however, there are no 
interpolated scores whatever. If a child was absent on the occa- 
sion of testing, he was followed up immediately and tested, with the 
exception of one child, in June, 1924. On this single occasion, the 
median of 19 instead of 20 children was found in the Lower Group 
(to which this absent child belonged). 


RESULTS 


The results of experimentation are presented by means of tables 
and graphs which seem to require no verbal description. In a 
few of the tests given, progress is not measured on the same scale, 
because different kinds of tests were used from occasion to occasion. 
In these cases the results are presented in Table V. All other tests 
represent progressive achievement, each on the same scale as a base 
from time to time, and are presented in the form of graphs. It will 
be noted that results from the miscellaneous achievements presented 
in Table V are in agreement with the data presented in the graphs, 
in that they show that the group testing at 165 I.Q. excels the group 
testing at 146 I.Q. in intellectual performances. 

Attention should be directed especially to the results from the 
tests of penmanship. There is no reliable difference between our 
groups in this achievement. It is safe to say, also, that they do not 
excel unselected children of their age in speed of penmanship, and 
excel them but slightly, if at all, in quality. 
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Thorndike-McCall Test C3 Combrehension m Reading 


Medien T Score f Children Clustering at 651Q(5-8) 
we Median T Score ef Children Clustering at 461QG-B) 


Maximum Score 2.0 


Ayres-Burgess lest of Sheed mRead ing 
o— Median of Group at 1651068) 


x — Median of roup 


at 14619 (SB) 


Woy 2 Apr. ay June July 14. Sept: Oct: Nov, Dec. Jan. 
1924; 
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Mowree Diagaestic Test 1: Addition of Whole Numbers 
eo Hedin & Children Clustermg at 65 KG® 
oa — —Medton of Cibiven Chatering at 146 1D. (5-8) 


Meare Diag restic Est 2: Subtraction of Whole Numbers Maximum Score 24 
o—Median of Children Clustering at 16519 GH) 


(49a 


rf Monroe Diagnostic Test 3: Multiplication of Whole Numbers Maximum Store 20 
a Medien of Children Chastering ab 165 10. 6-8) 

e~ diam ldren Chs er 
it shee ces ree) 
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She Monroe Diagnostic Fest 4 Short Division 
7 i: Median of Children Chast ering at 15 10.68) 


Mowroe Diagnostic Test 5: Addition of Whole Humbers 


@—Median of Children Chistering at 165 10-8) 
ye - Median of Children Chistering at 146 106-8) 


Munroe Diagmstic Test 6 Leng Division Maximum Seore 24 
o—Hedian of ChildrenChust ering at (65 1068) 
— Median of Children Clustering ot 4610 (Gey 


Mowroe Duagnostic est 7: Adiition of Whole Numbers 
o—Hedien of Children Clasteving at 165 106-8) 


e, 


SUPERIOR AND VERY SUPEBIOR CHILDREN COMPARED 17 


3 Moree Diagasticlest 8: Multiblication of Whole Numbers 
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More Diagnostic Fest 1. Adlition of Fractions 
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Moorea Diagnestic st 15: Addition of Fractions 
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Monroe DiagnosticTest Ig. Multiplication of Decimals Maximum Sesre 2+ 


o—Hedian of Children Clustering a 1651068) 
oe ~ Hodian ¢ Children Clustering at 145108) 


@ 


7 a Monroe Diagnostre Fest 9: Division ¢ Decimals Maximem Score 2¢ 
A : o—Median of Children Chistering ok 165 1QG-8) 
hee Median of Children Clustering at 1451968) 


Monroe Diagnostic Test 20: Multiplication of Decimals 
o—Hedian of Children Clustering ok 1651068) 
we dMedian ¢ Children Clustering at 451068) 


= 
3 
& 
$ 
a 
£ 
ee 
x 
2, 


SUPERIOR AND VERY SUPERIOR CHILDREN COMPARED 21 


absed 


of 


Stanford Achievement est: Favagrap Meaning 
Maximum Score i, re 


o—Medianof Group at 165 10 G8) 
a x — Median ge Group at 4610 68) 
we Norm forUnselected Children 


Bere 1923 ne, 1924 Sept. J24 


22 


THE TWENTY-SEVENTH YEARBOOK 


eaning 
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Stanford Achievement lest 4: Arithmetic Combubabion 


Maximum Score 188 


/ : o—Median of Group at 1651Q(58) 
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: ‘Stanford Achievement lest 5: Arithmebic Reasoning 
Maximum Score 160 


o—Medianof Groubat 1651068) | 
*- Medianof Group at 1461068) 
« e « Norm for Unselected Children 
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Stanford Adjereneublest 8: Language Usage 
Maximum Seore 60 
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op ey, 1923 , ptt] 192 3 
INTERPRETATION OF THE RESULTS 


It is immediately evident that, among the functions herein 
tested, comparative achievement varies greatly. In a few functions 
there is no advantage from the additional intelligence represented 
by 165 I.Q., as compared with 146 I.Q. In other functions the 
difference in favor of the Higher Group seems comparatively slight, 
but in most instances the difference seems pronounced. 

The question arises as to how we may compare the magnitudes 
of the differences found; how we may, for instance, compare a 
difference of one correct solution in Long Division (Monroe Diag- 
nostic Test 11) with a difference of ten points in T score in the 
Thorndike-McCall Test of comprehension in reading. 

We have adopted the following method of arriving at a rough 
estimate of the magnitudes of the differences found between the 
two compared groups. The time saved by the Higher Group in 
arriving at the point reached as a final score by the Lower Group 
has been calculated in months, and the saving thus accomplished 
has been taken as a sign of the magnitude of the difference. 

These calculations are but approximate, for several reasons. In 


the first place, it must be borne in mind that the testing of performance 
was discontinuous, so that we cannot tell just what the respective 
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achievements of both groups were at every moment in time. We know 
these only for certain sample moments, as indicated on the graphs. 
Therefore, to find the point in time at which one group equals the final 
achievement of the other, we have to calculate from the slant of the 
curve between two tested moments. This, of course, is not so accurate 
as the result from continuous testing would be. In the second place, 
there may arise in a few of the tests an error of handicap upon the 
Higher Group, from the fact that children having the highest I.Q.’s 
reached a top or nearly top score in some of the tests before the final 
trials. This error we have sought to avoid by our method of compari- 
son which cites the Higher Group at amy point in its career wherever 
it meets the final median score of the Lower Group. We have sought 
also to avoid it by taking the medians rather than the means of the 
groups as points of comparison. 

Similar comparisons of ‘time saved’ with the norms for un- 
selected children, where such are available, cannot be made, because 
neither of our experimental groups overlaps at all with unselected 
children during the time covered by the three years of experimenta- 
tion. In the Stanford Achievement Tests, for instance, our two 
groups at 9 years, 6 months, of age very much exceed the scores of 
ordinary children who are 11 years, 3 months, old (the age at 
which our experimental groups were tested for the last time). We 
have, however, included on the graphs the normal curves for un- 
selected children in achievements for which age norms are avail- 
able or can be inferred. 

Table VI shows how the various achievements compare with 
each other in the differences which they show between children 
clustering at 165 I.Q. and those clustering at 146 I.Q. according 
to the method of calculating time saved, which we have just de- 
scribed. The tests are listed in order of amount of time saved by 
the Higher Group, with those listed first in which most time was 
saved. It is interesting that the Higher Group saved time in every 
test of achievement but one, though the amounts vary from 16 
months to 0 months in various performances. 

In those functions or tasks which head the list for magnitude of 
difference between our groups, the curves of performance are never 
in contact from the time either group rises above zero. In those 
which fall at the bottom of the list there is practically no advantage 
realized by the Higher Group from its additional +2 P.E. of>in-~* 
tellect. Addition of Whole Numbers can be performed as well by 
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TaBLe VI.—Time Savep By THE Hiacuer I. Q. Groups in ATTAINING VARIOUS 
LEVELS oF ACHIEVEMENTS 


Months Saved by 


Test of Achievement Higher Group 


Word Meaning, Stanford Achievement 3................ 16 
Paragraph Meaning, Stan. Ach. 1 
Sentence Meaning, Stan. Ach. 2.................2.+.-<: 14 
Nature Study and Science, Stan. Ach. 6................. 14 
* Understanding of Paragraphs, Thorndike-McCall......... 13 
Addition of Fractions, Monroe Diagnostic Test 9......... 13 
ILO IDNs eM, Moy, IDG. ok Ack Saco soon aoocuoeue. 13. 
Multiplication of Whole Numbers, Mon. Diag. 8......... 13. 
Spelling from Dictation, Stan. Ach. 9................-.-: 12. 
iangusgerWsacens tania chs Suse eecdasiee-ciloes ccleaner ial 
History and Literature, Stan. Ac. 7..............c000:: 
Song Divisions Mons Diggalinn sane erence nce cee ae 
Subtraction of Whole INumberas Mont Diag 2naeeeee cece 
Division of Fractions, Mon. Diag. DGS. 5 Aree orate eer eye 
* Subtraction of Fractions, Mone Dia gw 3 circ ea cece oe air 
Speed in Silent Reading, Ayres-Burgess................- 
Arithmetic Computation, Stan. Ach. 4.................. 
+ Multiplication of Whole Numbers, Mon. Diag. 10........ 
Arithmetic Reasoning, Stan. Ach. 5...............0.00-- 
Multiplication of Whole Numbers, Mon. Diag. 3......... 
Addition of Fractions, Mon. Diag. 12................... 
#Subtraction of Whole N umbers, Mon. Diag. 9............ 
* Division of Decimals, Mon. Diag. 19.................... 
Short Division, Mon. ID TE Was USF sel CRE COO oCLG 3 
Multiplication ‘of Decimals Mon Diag See se ea. cela 
Multiplication of Fractions, Mon. Diag. 14.............. 
Division of Decimals, Mon. Diag. 17...................-- 
Division of Decimals, Mon. Diag. 21..................-- 
Multiplication of Decimals, Mon. Diag. 20............... 
Addition of Whole Numbers, Mon. Diag. 7.............- 
Addition of Whole Numbers, Mon. Diag. 1.............. 
* Addition of Whole Numbers, Mon. Diag. 5.............. 


In the instances where the final score of the Lower Group is not its best 
score, the comparison has been made on the basis of the best score as a final score. 


SmNONooCO OO OUNMNOOMUMNOOCOOCOONOOMMaUCO 


SOWWKERAMNAAMAONNNNAOOOCOCSO 


children of 146 I.Q. as by children of 165 I.Q., at the ages studied. 
How is this to be explained ? 

The failure of the Higher Group to draw away from the Lower 
Group in these achievements cannot be explained by saying that 
both groups had reached the maximum of proficiency, for at no 
point until near the end is there approach to a perfect score. The 
curve rises quite steadily from one occasion of testing to the next, 
but rises equally for both groups. 

Perhaps the explanation of these facts lies in the motor me- 
chanics of the operation. It probably is true that young children 
of I.Q. 146 can perform additions with whole numbers, like those 
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found in Monroe Tests 7, 5 and 1, about as rapidly as the eye and 
hand can execute the task; and that at and above this level, there- 
fore, increments of intellect give no advantage. Both groups of 
very intelligent children show, however, a decided advantage over 
unselected children, even in these operations; for Monroe’s norms 
are well below the performances here displayed, according to grade 
norms (and inferred age norms), both when children enter school 
at five years and when they enter at six years.* The point where 
such achievement ceases to differentiate degrees of intellect falls, 
therefore, somewhere between 100 I.Q. and 146 I.Q. 

What is meant by the interpretation here offered may be fur- 
ther illustrated by a very simple example. Ten-year-olds of I.Q. 
100 can tell their names as readily as can those of 1.Q. 150. So 
thoroughly proficient has each group become in this very simple 
task that each is limited in performance of it only by the mechanics 
of articulation, which do not differentiate intellectual groups of 
ten-year-olds above a certain very low level. 

In appraising all differences it is necessary of course to bear 
in mind the maximal possible score. This is given on the graphs. 
It will be noted that in some performances the Lower Group ap- 
pears to approach the Higher Group toward the end of the com- 
parison; whereas inspection of the data will show that so many of 
the Higher Group have in fact reached or so nearly reached the 
highest possible proficiency that the comparison is no longer wholly 
valid, because the Higher Group has exhausted opportunity. Herein 
lies one of the most important criticisms of available tests for pur- 
poses of experimentation such as this. Children as gifted as the 
best in our Higher Group, that is of I.Q. above 160, are ‘off scale’ 
by the time they are ten or eleven years old. We have tried to 
allow as well as we could for this handicap upon our Higher Group 
by adopting group medians, instead of group means, as our basis 
of comparison. 

SPECULATIVE INTERPRETATION 

From our series of achievements as they stand we infer that a 
difference of +2 P.E. in I.Q. above 146 I.Q. (S-B) at the ages 
studied shows itself in comparative performance in proportion to 
the complexity and difficulty of the task attempted. 


* We have been unable to gain precise information as to age norms for 
the Monroe Diagnostie Tests, 
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The few performances in which there is no demonstrable dif- 
ference between our two groups are relatively simple, as is shown 
by the fact that they have fewer elements and thus require fewer 
associations than tests which differentiate the groups. For instance, 
in addition of fractions (which clearly shows an advantage for the 
group at 165 1.Q.) there are all of the elements involved in addi- 
tion of whole numbers (which gives them no advantage) and many 
more besides. The former requires all the associations required by 
the latter, and other groups of associations as well. 

Also, it is true that average children can make a score above 
zero, at all the ages here studied, in addition of whole numbers, 
whereas in addition of fractions they do not rise above zero until 
they are about eleven years old. It may pertinently be said that 
average children are not taught fractions until that age, but is 
there not some reason for this? 

We infer that it would be possible to discover by experimenta- 
tion scores of tasks which, because of their comparative simplicity, 
would not differentiate ‘very high’ from ‘still higher’ intelligence. 
Experimentation with groups like ours affords a method of dis- 
covering which among the world’s tasks will clearly give advantage 
to the best one tenth of one percent of intellects. Among tasks tried 
by us the following seem to give most advantage to increments of 
intellect: abstraction of meaning from words, sentences, and para- 
graphs; addition of fractions; nature study and science; long 
division; spelling from dictation; and language usage. Among 
arithmetical processes, increments of intellect are more valuable in 
dealing with parts than in dealing with wholes; in subtracting 
more than in adding; and in dividing more than in multiplying. 

We know from our tables and graphs that equalization of op- 
portunity does not equalize our groups. There is throughout the 
three years a marked difference in achievement. It would be of 
great interest to determine, if possible, the comparative magnitudes 
of this difference at the beginning and at the close of the period of 
equal opportunity in special classes. This comparison is not, how- 
ever, possible by any method known to the present investigators. 
True comparison in terms of time saved at each extreme is impos- 
sible because the scales used are not in terms of equal units from 
beginning to end; and also because opportunity is restricted for 
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the best children toward the end of their climb by the limitations 
of the tests of achievement. 

Perhaps a word may be added here to call attention to the fact _ 
that the achievements in which the two groups differed most are 
precisely the ones in which opportunity had been most indisputably 
equal. The world of word, sentence, and paragraph meaning lay 
open to all alike, as presented, for example, by the special library 
of the opportunity classes, the public libraries of New York City, 
and the whole reading environment of the city as it pressed upon 
the children from every sign board, news stand, and other such 
agency for the presentation of the printed word. 


CoNCLUSIONS 


1. Children of the ages herein studied cannot be equalized in 
achievement by equalization of opportunity. Those who test 
higher by available tests of intelligence, are subsequently found 
to stand higher in achievement both at the outset of a three-year 
period and throughout the whole three years of equal and special 
opportunity. 

2. A group of children 8 to 11 years old clustering at 165 1.Q. 
(Stanford Binet) differentiates itself from a group of the same age, 
clustering at 146 I.Q:, in performances covering intervals of time 
even as brief as 30 seconds. 

3. By tests of intelligence at present available educability is 
tested. Comparative achievement under equal opportunity can be 
foretold from these tests with a high degree of reliability. 

4. In the course of experimentation it is found that mental 
tasks differ widely in the extent to which they utilize increments 
of intelligence. There are scholastic performances which do not 
profit from the margin of intellect represented by the difference 
between 146 I.Q. and 165 1.Q. 

5. Apparently, the differences in achievement between the two 
groups herein compared increase in magnitude as the elements or 
associations involved in the tasks increase in number; or in other 
words, as the tasks increase in complexity. 

6. Children clustering respectively at +6 P.E. and +8 P.E. 
in mental tests, stand far above unselected children in all perform- 
ances herein studied, with the exception of penmanship. This 
superiority of achievement is not to be attributed to the special 
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opportunity of the former, since those at +8 P.E. also outstrip 
those at +6 P.E., although special opportunity was equalized for 
these two groups. 


7. Children clustering at 165 I.Q. do not differ appreciably or 
consistently from children clustering at 146 I.Q. as respects home 
ratings (all children were drawn from the public school population 
of the same city). Nevertheless, there is a wide difference in 
achievement between two groups thus constituted, under conditions 
of equal opportunity in the school. 


APPENDIX 
Sources of Tests Used in Experimentation 


Ayres-Burgess, M., A Scale for Measuring Ability in Silent Reading. 
Russell Sage Foundation, New York City 

Buckingham, B. R., Extension of the Ayres Spelling Scale. Public 
School Publishing Co., Bloomington, II. 

Chapman, J. C., and Cook, S., Speed of Reading Tests (Grades IV to 
VIII). J. B. Lippincott Co., Philadelphia, Pa. 

Horn, E., and Ashbaugh, E. J., Spelling Tests, First Series. J. B. Lip- 
pincott Co., Philadelphia, Pa. 

Kelley, T. L., Ruch, G. M., and Terman, L. M., Stanford Achievement 
Test. World Book Co., Yonkers-on-Hudson, New York 

Monroe, W. S., Diagnostic Tests in Arithmetic. Public School Publish- 
ing Co., Bloomington, III. 

Monroe, W. S., Standardized Reasoning Test in Arithmetic. Public 
School Publishing Co., Bloomington, Ill. 

New York Scale for Grading Penmanship, Bureau of Research and Ref- 
erence, Department of Education, New York City 

Terman, L. M., The Measurement of Intelligence. Houghton Mifflin 
Co., Boston, Mass. 

Thorndike, E. L., and McCall, W. A., Reading Scale. Teachers Col- 
lege, Columbia University, New York City 

Thorndike, E. L., J. E. R. Graded Test of Word Knowledge. Institute 
of Educational Research, Teachers College, Columbia University, 
New York City 

Trabue, M. R., Nassau County Supplement to the Hillegas Scale for 
Measuring the Quality of English Composition. Teachers College, 
Columbia University, New York City 

Van Wagenen, M. J., American History Scales. Teachers College, 
Columbia University, New York City 

Williams, J. H., A Guide to the Grading of Homes. Whittier State 
School, Calif. 

Willing, M. H., Scale for Measuring Written Composition. Public 
School Co., Bloomington, Ill. 


PREFATORY NOTE TO CHAPTER II 


The data reported by Heilman show beyond reasonable doubt 
that not over five percent of the enormous variability in school 
product to be found among children of the same age in ordinary 
communities (or at least among children of age 10, the age of the 
children of this study) is due to differences in educational exposure. 
The application is readily enough drawn that to deny school pro- 
motions to children solely because of absence (unless extremely pro- 
longed) is liable to work an injustice upon pupils and to tax school 
facilities unduly. Regardless of shortage in the educational process 
of months, or even of a year or two, children tend to attain a 
proficiency in school subjects close to that appropriate to their 
mental level. We are reminded of Breed’s experiments in which 
young chicks that were prevented from learning to peck grain 
during the first few days of life became, when released, as pro- 
ficient almost at once in the pecking response as control broods 
which had practiced pecking from the time of hatching! Results 
such as those of Heilman open the question as to whether eight 
years of school attendance is really necessary to bring pupils up to 
the standard usually achieved by the eighth grade. One wonders 
whether in the four or five years between ten and fourteen they 
might not learn to read, write, and spell as well, and master as 
much arithmetic, history, and geography as they would be likely 
to in eight years. 

The author’s use of the Wright ‘‘path-coefficient method’’ is of 
great interest, since this technique has not been applied to educa- 
tional problems previous to the publication of the present Yearbook. 
The author’s cautiousness in the application of this method, which 
offers a basis for hypothesis no more sure than does any other tech- 
nique having less mathematical precision, is especially to be 
commended. 
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CHAPTER II 


THE RELATIVE INFLUENCE UPON EDUCATIONAL 
ACHIEVEMENT OF SOME HEREDITARY AND 
ENVIRONMENTAL FACTORS? 


J. D. HEILMAN 
Colorado State Teachers College, Greeley, Colorado 


PURPOSE 


The specific purpose of this investigation was to determine, in 
so far as its nature and the development of statistical technique 
permitted, the relative influence upon scholastic achievement of 
mental age, school attendance, and socio-economic status of the 
home, with the hope of throwing some light upon the relative weight 
of inborn capacity and environmental influences in determining 
the large individual differences in acquired abilities. In this study 
acquired abilities are represented by scholastic attainment as de- 
termined by the Stanford Achievement Test, environmental influ- 
ences by school attendance and socio-economic status, and inborn 


* This investigation was made in response to a request from Dr. L. M. 
Terman for a contribution to the present Yearbook. I am indebted to Dr. 
Terman for assistance in planning the investigation, to Miss Barbara Burks 
for help in the employment of statistical techniques, and to both Dr. Terman 
and Miss Burks for aid in making an interpretation of the results. The plans 
for the investigation were completed in October, 1925. 

The investigation was made possible through the scientific interest of 
President G. W. Frasier, Superintendent J. H. Newlon, and Deputy Superin- 
tendent A. L. Threlkeld. The entire study was financed by the Colorado State 
Teachers College and the Denver Public School System at a total cost of $1,000. 
All of the data were collected in the public schools of Denver under the super- 
vision of Charles EH. Greene, the Director of Measurements. 

A number of paid assistants were engaged to collect data, administer and 
score the tests, and tabulate the results. The tests were given by two indi- 
viduals who had received excellent training and practice for the work. All 
of the computations were made by me. I wish to express my gratitude to all 
those who gave assistance in any way in making this investigation. 

I also wish to acknowledge my indebtedness to Dr. F, L. Whitney, Director 
of our Research Department, for permission to use some of the results of 
an investigation which is now being made by him and his Assistant, Wilford H. 
Woody. I am further obliged to Miss Henriette K. Woolf for permission to 
use some of the results of her master’s thesis. 

In this article only those results will appear which have a direct bearing 
upon the problems of the present Yearbook. I expect to publish elsewhere 
additional results which it is hoped will be of practical value to education and 
its administration. I also intend to publish the re-standardization of the 
Chapman Socio-Economic Scale which was used in this study. 
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capacity by mental age as determined by the Stanford Revision of 
the Binet Scale. 

The solution of this problem is of the greatest practical import- 
ance to educational and eugenic measures. If native capacities and 
training, or environmental influences of the school and home, have 
an equal share in accounting for differences in abilities, then there 
should probably be a well-balanced emphasis upon educational and 
vocational guidance on the one hand and training on the other; but 
the emphasis should be shifted to guidance if inheritance is the 
important factor and to training if the influence of the environ- 
ment plays the more important réle. At present, especially in our 
high schools and colleges, the factor of guidance is receiving more 
and more recognition. We are, therefore, in great need of scien- 
tific guidance in our educational practice. Similarly, we are unable 
to deal wisely with the problem of the application of eugenic 
measures without reliable knowledge concerning the relative influ- 
ence of nature and nurture. 


GENERAL PLAN 


A large group of children, conforming as nearly as possible to 
the statistical requirements for accuracy, was selected for measure- 
ment. For each child included in the investigation the following 
data were obtained: 

1. Educational age in months 

2. Mental age in months 

3. Life age in months 
4. School attendance in days for each grade (and for the 
kindergarten) 

Socio-economic status of the home, in terms of S.D. 
Date of entering the first grade (and of the kindergarten for 
those children who had kindergarten training) 


nn 


Educational age was taken as an effect which it was assumed 
was caused by environmental and hereditary factors. The environ- 
mental factors are represented by school attendance and the nature 
of the home, and the hereditary factors by mental age. No claim 
is made that all of the environmental influences upon educational 
age are included in school attendance and the socio-economic status 
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of the home or that all of the hereditary influences upon educa- 
tional age are embraced in what is called mental age. Whatever 
influences have been measured, it is proposed to determine their 
relative potency in producing variation in educational age under 
the existing school and social conditions of the communities from 
which the data were obtained. 

This may be done by making use of the fact of variation. We 
found a very large individual difference in the effect, or educa- 
tional age—represented by the amount of variation. Similar indi- 
vidual differences, or variations, were obtained for the factors 
assumed to be causes of the effect. Now, the relative potency of our 
causal factors in producing educational age can be determined by 
showing which of them contributes the largest portion to the 
amount of variation in educational age. We may say that school 
attendance is the most potent factor if a given amount of change in 
it produces a more marked change, or variation, in educational age 
than an equal amount of change in mental age and in socio-economic 
status will produce. If, on the other hand, variation in educational 
age is determined more by changes in mental age than by equal 
changes in either of the other factors, then mental age is the more 
potent in influencing educational age. 

In reading this article it is important to remember that such 
results as have been obtained do not apply to widely different situa- 
tions. Larger variations in any one of the causal factors would, 
no doubt, have increased its weight in contributing to the amount of 
variation in educational age. Life age for our group of children 
was restricted to an extreme variation of only nine months. This, 
undoubtedly, reduced the variation in educational age, mental age, 
and school attendance. Moreover, the children were selected from 
schools located in communities of average social standing, and this 
no doubt reduced the variation of socio-economic status. However, 
on our measuring instrument for this factor, the highest and lowest 
possible scores were made. 

On account of the very narrow range of life age, the influence 
of this factor upon the variation of each one of the other factors 
was eliminated by means of partial correlations. The standard 
deviation for life age was only 2.6 months, and the coefficients of 
correlation for life age and educational age and life age and mental 
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age were only .185 and .120, respectively. No attempt is made, 
therefore, to determine the relative significance of life age in con- 
tributing to educational age. 

For the purpose of actually determining the relative weight of 
the causal factors in producing variation in the effect, the correla- 
tion technique was employed. 

The technique most commonly employed for the purpose of 
finding the proportionate contribution of several causes to an effect 
is the partial and multiple correlation technique. This technique? 
is adequate only if the factors whose true correlation is sought do 
not contribute to the factors which are rendered constant. But 
for our factors, or variables, this assumption cannot be justified, 
because mental age may be expected to contribute to socio-economic 
status through the resemblance which children bear to their parents 
in mental age. If this is the case, then the true correlation between 
educational age and mental age is reduced too much by rendering 
socio-economic-status constant, because this reduces the size of the 
coefficient, not only by reducing the variation in educational age, 
but also, in violation of the principles on which the technique is 
based, by reducing the variation in mental age. If mental age con- 
tributes to school attendance, then the true correlation between 
educational age and mental age is again reduced too much by ren- 
dering school attendance constant. 

A technique somewhat better adapted to the solution of our 
problem is that of path coefficients, suggested to me by Miss Burks. 
By this method, which will be described more fully in subsequent 
pages, the direct contributions to the effect made by the different 
causal factors are also reduced too much, but the reduction, instead 
of being entirely lost, appears in the joint contribution made by the 
causal factors to the effect. The method is also far less laborious 
than the partial and multiple correlation technique. 

Our problem presents other difficulties which are probably of 
minor importance. For example, if the factor taken as the effect 
and any one of the factors taken as causal have a reciprocal influ- 
ence upon each other, no correlation technique can determine how 
much variation in the factor taken as the effect is produced by the 


* Burks, Barbara 8. ‘‘On the inadequacy of the partial and multiple cor- 
relation technique.’’? Jour. of Educ. Psych., 27: Nov., 1926. 
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factor taken as the cause. Now, in this investigation, I am of the 
opinion that grade placement and effort are factors which bear 
such a reciprocal relation to educational age. Just what their 
significance is, it is impossible to tell. However, they have been left 
entirely out of the computations. 

It is also possible that school attendance and mental age, socio- 
economic status and mental age; educational age and school attend- 
ance, and educational age and mental age bear a progressive re- 
ciprocal relation to one another. Under such conditions the corre- 
lations obtained would hold precisely only for children who have 
reached a stage in their chronology represented by the age of the 
subjects of this study (ten years). Moreover, in the event of re- 
ciprocal action between educational age and any of the other varia- 
bles, we could conclude only that the relations uncovered by the 
statistical treatment employed indicated a probable upper limit 
which the true causative effects of certain variables upon the cri- 
terion would not exceed. For example, later on, where the state- 
ment is made that about 45 percent of the variability in educational 
age is due to mental age, about five percent to school attendance, 
and about nine percent to the combined effects of mental age and 
school attendance, a more cautious interpretation might waive the 
idea of causation and state merely that mental age and educational 
age have a considerable degree of influence upon one another, and 
that educational age and school attendance contain only a small 
degree of mutual implication. The direction of influence may be 
partly or even largely from educational age to mental age and 
school attendance, rather than from school attendance and mental 
age to educational age, though the latter seems the more probable. 
But it is certainly significant to know that if differences in school 
attendance could be equalized experimentally without simultane- 
ously altering the composition of the group as regards age, mental 
age, and socio-economic status, the variance of the group in educa- 
tional age would not be decreased by more than thirteen percent, 
and possibly by less. 

Another point in which caution of interpretation must be used 
is connected with the assumption that the variables whose ‘effect’ 
upon achievement we seek to measure account in themselves for the 
influences found. No one could prove without a fundamental and 


40 THE TWENTY-SEVENTH YEARBOOK 


carefully controlled experiment whether or not an apparent 
‘unique’ contribution of socio-economic status to educational age 
was due actually to environment or to definitely heritable, but un- 
measured factors, other than Binet intelligence, which are corre- 
lated with socio-economic status. Similarly, no one could prove 
that the apparent contribution of Binet mental age was due en- 
tirely to intelligence, or in part to unmeasured factors, such as 
interest or effort, which are probably correlated with intelligence. 
In view of such considerations, it should be doubly emphasized 
that the relations found in this study represent only probable upper 
limits beyond which the true influences of the measured ‘causes’ do 
not lie. 


THe Data 


The subjects of the investigation were 828 school children who, 
on March Ist, 1926, ranged in age from 10 years, 1 month, to 10 
years, 10 months. All but 50 of these children had attended no 
other than the Denver schools. These 50 had not attended any 
other schools more than two years. All children who had foreign 
parentage and whose school records were incomplete were excluded 
from the investigation. All other 10-year-olds in each of the 48 
schools involved were included in the investigation. It is believed 
that this is an unselected group. Thomson and Pintner® say ‘‘one 
eriterion for the unselected nature of any group is that the co- 
efficient of correlation of age and mental age in such a group should 
be approximately equal to the ratio of the coefficients of variability 
of chronological and mental ages.’’ For this group the coefficient 
of correlation of age and mental age is .120, and the ratio of the 
coefficient of variability of age to that of mental age is .142. These 
figures, accordingly, meet the requirement of approximate equality. 

Practically all of the data were collected by the two examiners 
already referred to, Mrs. Gladys May Maclin and Mrs. Dallas 
Allhands. They began their work on January 25, 1926. On the 
same day Superintendent Threlkeld sent a letter to all of the prin- 
cipals and directors of the schools involved in the investigation, in 
which he explained the nature and purpose of the investigation 


* Thomson, Godfrey H., and Pintner, Rudolph. ‘‘Spurious correlation and 
relationship between tests.’’ Jour. of Educ. Psych., 15: October, 1924. 
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and asked them to codperate. As outlined by Mr. Greene, the Di- 
rector of Measurements, the examiners first listed the names of all 
the children who were born between May 1, 1915, and January 
30, 1916. They next determined which of these children had at- 
tended no other than the Denver schools. The next step was to send 
questionnaires to the parents to secure information in regard to the 
homes. This was followed by giving the achievement tests. The 
remainder of the time was devoted to the giving of mental tests. 

All of the age and attendance data were taken from the records 
of the Denver schools. The occupation of the father was also taken 
from the records. These were copied on blanks especially prepared 
for the purpose, of which the following is a sample: 


BLANK FOR AGE AND ATTENDANCE DATA 


Fill out this blank only for children whose birth dates lie between 
May 1, 1915, and January 30, 1916, inclusive. Do not include children 
of foreign parentage and children whose records are incomplete. 


DV SIOROIE CHIL Fete oriee > cc.ere (60,10 0(e SSPATCM CALC ters a aisiesielevetes Daten 
ICHIOOI Par Peat nts, states os eighecle cae viele aie k eo Grader ue cssionsemrcoseicouts 
Date Onentenar tie Kinderyarteni iss so sls «sic 4. sisis's 6.0 aoe oes a tsie aes weirs 
Watenofsetiterme the itst Orades, ss /sterw s\sie'e o's. aveeie wiesis tie ae cieice save os 


List the attendance in days in each of the following: 


Kinderodstermtar .atentases et Cita dey LI 5y dove 001 s'ee Grades VIt.e.n can scicnts 
Grade: Weweraae icc ects sees Grade LV amuence dans Grade’ Vil nareee son: 
(srade liv ee cra. see oes be GradenVigtteasters ssc Grade Vill tens eaccee 
CCCUDALION RO Le ate Ao. ateieler<'e eave sicin's tistes.eisiers » #eha-e'wielele ae) ste gre wieesteege 


The educational age was based upon the results of the Stanford 
Achievement Tests. These tests were given during the two weeks 
preceding and the two weeks following March 1, 1926. As the 
probable error on this test for ten-year-olds is 1.5 months, no cor- 
rection was made for differences in the time of administration. All 
of these tests were scored and the results tabulated by experienced 
students of the Colorado State Teachers College under the super- 
vision of an assistant in the Department of Psychology. 
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The mental age of the children was determined by means of 
the Stanford Revision of the Binet Scale. This test was selected 
in preference to group tests because it is practically free from the 
influence of school training. The tests were given from the middle 
of March to the end of the school term about the middle of June. 
By means of the I.Q. the mental ages of the children were all cor- 
rected to March 1, 1926. Both assistants who administered the 
tests were experienced in Binet procedure. Mrs. Maclin received 
part of her training in giving Binet Tests from Dr. Terman and 
part of her training from Mr. Greene in a summer course at the 
University of Denver. Mrs. Allhands received her training at the 
University of Denver under Dr. Garth. 

The method of checking on the tests was as follows: A small 
number of tests, from six to twelve, were first given by each exam- 
iner, then checked by both; later, they were checked by Miss 
Florence Barney, who does most of the Binet testing in the Denver 
schools and who is well trained and very capable in this work. 
Each week of the eighteen Mrs. Maclin and Mrs. Allhands checked 
their method of scoring and administering the Binets. 

The attendance of the school children was determined up to the 
end of the first semester, January 25. As the achievement tests 
were given around the first of March, no large discrepancies in 
attendance could have occurred between these two dates. Large 
errors in attendance might presumably have affected the scores on 
the achievement tests. 

The cultural and economic data, exclusive of the father’s occu- 
pation, were obtained from the home by means of a questionnaire. 
The data and the method of securing them are sufficiently ex- 
plained by the following letter and ‘‘ Question Blank for Parents.’’ 
If Item 12 of the blank—‘‘Do you live in an apartment house ?’’— 
was answered in the affirmative, Item 18 concerning the number of 
rooms in the house was not used in determining the score. 


TO PARENTS: 


The National Society for the Study of Education has asked the 
Denver Public Schools to codperate in making a study of normal ten- 
year-old children to discover the importance of regular attendance. 
Most of the necessary information can be secured from school records. 
The information which is asked for below may seem to be of a very 
personal nature, but it has a bearing on educational results. It will be 
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used only for this study and by those who are interested in the results 
from the city as a whole. It will not affect the school standing of your 
child and will not be seen by the teachers. The forms will be destroyed 
as soon as the results are compiled. 


It is important that the blanks be filled out accurately and com- 


pletely. We appreciate your coédperation. 


Cuartes E. GREENE, 
Director of Measurements 
Denver Public Schools 


DENVER PUBLIC SCHOOLS 


AN INVESTIGATION RELATING TO POSSIBILITIES 
AND LIMITATIONS OF TRAINING 


QUESTION BLANK FOR PARENTS 


Name rOlacuildetn Siete Je ce veces tas Dateres weet. Schools. cm ce eck 
T.20 you Have atelepnone in your NOME? foc os 2.4 oe «e's noe Nee ee soe 
Ze Didethestatherror the childsattend hiohwschool re. qe. «ene see ee 
3. Did the mother of the child attend high school?................. 
4. Is English the only language spoken in the home?................ 
Rls vOut Ouse Neated DY,a AUELACE N's asraesermgiers Be elciela's oust nye nia este 
(me owou Uave all auto otner tian 2 track? ss... ss. as sled sae eae 
Taw Oey OUstia Veda, PIANO IN VYOUT NOME fos hats elsle sae bie atennraielesls ee rks 
Sm Do you have a pnosorrapn in yout Home? yo... 6% «mise mglcle en es 
9. How many daily papers do you take regularly in your home?...... 

10. How many magazines do you take regularly in your home?....... 

11. About how many books are there in your home?..........s.eeee 

{2.5 Dopyouw live in ancapattment house ?. 2: ...5 6 an sea ce es sesamiae s 

1z. How many rooms, not including bathroom, are there in your 

DOMON  airekie walteniels 


CY 


(Name of parent) 
Note—Please put this question blank into the attached envelope. 


Seal the envelope and ask your child to hand it to his teacher to-morrow 
morning. 
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The information asked for on these blanks was complete for 688 
cases. The data on all of these cases were used for the purpose of 
making a socio-economic scale. The technique used in making the 
scale differed only in minor particulars from that used by Chapman 
and Sims.* Their scale was not used because we included two items, 
number of rooms and occupation, which do not appear in their scale. 
Moreover, the item on the number of magazines had to be treated 
differently to make its inclusion significant. The scores on each of 
the items are expressed in sigma times ten. The lowest score that 
can be made on the entire scale is 578, and the highest 702. As the 
information was incomplete for 140 cases, owing chiefly to the 
decease of the father and residence in apartment houses, the total 
scores made on the 13 items were averaged and the result was multi- 
plied by 10. This reduced the lowest possible score to 445 and the 
highest possible score to 540. 

The reliability of the scale was determined by computing a co- 
efficient of correlation between the halves of the scale. The scores 
made on Items 1, 2, 8, 9, 11, and 13 formed one half, and the scores 
made on the remaining items the other half. The attempt was made 
to pair the items in such a way that those in the one half would 
represent as nearly as possible the same things as the items in the 
other half. The coefficient for the two halves is .77; that between 
the results of this scale and those of a similar one, .87. Corre- 
sponding figures for the Chapman-Sims scale are .63 and .77, re- 
spectively. The coefficient of reliability for the revised scale is 
somewhat higher than it should be because about 50 of the 688 
children made perfect scores. 

The schools involved in the investigation were selected as nearly 
as possible in districts of average economic status. This was done 
to equalize as much as possible the socio-economic factor. However, 
the results of the scale show that every grade of socio-economic 
status provided for by the scale was represented in all of the schools 
combined. 

The investigation by Miss Woolf* was patterned after this one and 
is almost an exact duplicate of it. The same tests were used. The 


“Chapman, J. C., and Sims, V. M. ‘‘The quantitative measurement of 
certain aspects of socio-economic status.’’ Jour. of Educ. Psych., 26: Sep- 
tember, 1925. 

*Woolf, Henriette K. The Relative Influence of Some Environmental 
Factors and Mental Age on Educational Age. Unpublished Master’s Thesis, 
August, 1926. Colorado Teachers College, Greeley, Colo. 
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Chapman-Sims® Scale was changed so as to include “number of rooms” 
and “the father’s occupation” and was then re-standardized with the 
Chapman-Sims technique. The subjects were all of the 10-year-old chil- 
dren of the Greeley Schools. Computed to March 1, 1926, the children 
ranged in age from 9 years, 9 months, to 10 years, 9 months, with an 
average age of 10 years, 2 months. Miss Woolf did all of the work 
personally, for which she was well prepared, having taken her A.B. de- 
gree at Stanford University, where she majored in the Department of 
Psychology. Perhaps the most significant difference between the two 
investigations is the difference in the number of cases—her cases num- 
bered only 157. In interpreting her results she employed the partial 
coefficient and regression equation technique. However, she used the 
score-form of the equation and weighted the coefficients with the size of 
the arithmetic means. I changed this form of the equation into the 
special deviation form with the sigmas equal, and computed a similar 
equation with my data in order to make the results of our investiga- 
tions comparable. 

Dr. Whitney is at present engaged in making a very comprehensive 
study of “The Selection, Achievement, and Advancement of Secondary 
School Children.” In making this investigation he used the Stanford 
Achievement Scale, the Multi-Mental Scale and the Chapman-Sims 
Scale to test a very large sample of children of the state of Colorado. 
In addition to these measurements, he obtained data on school training 
(number of years in school), distance from school, physique (departure 
from normal weight for a given height and age), permanence of resi- 
dence, and life age. 

In connection with the general investigation, these data were ob- 
tained for the ninth-grade children in the over-three-teacher schools of 
Weld County, Colorado. In the statistical treatment of these data, par- 
tial correlations and the regression equation in deviation form with 
sigmas constant were employed. For the ninth-grade girls the method 
of path coefficients was also used. Educational age was taken as the 
dependent variable. As the data were not complete for all of the chil- 
dren, the zero-order correlations for the boys involved from 233 to 292 
cases, and for the girls from 228 to 308 cases. 

The Whitney-Woody investigation, however, differs from this one 
and Miss Woolf’s in important respects. The children were much older 
and differed far more widely in life age—the coefficient of variation for 
their group of girls was about 3 times the size of the coefficient of vari- 
ation for the group of children of this investigation. The basis of select- 
ing their children was that of membership in the ninth grade in the 
over-three-teacher schools of Weld County. More factors were included 
in the list of independent variables and mental age was measured by 
means of a group test. The attendance was only roughly determined 
by having the teachers report the number of years each child attended 


*Chapman-Sims. Loc. cit., page 13. 
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school. In case a child attended different schools, the teacher of the 
school in which the child was located at the time of the investigation 
wrote for the child’s attendance records in the schools previously at- 
tended. Moreover, the majority of the children were rural, belonging 
to the farming class. It is, therefore, to be expected that their results 
will agree less closely than Miss Woolf’s with those of this investigation. 


RESULTS 


1. Effect of Kindergarten Training 


On account of the somewhat prevalent belief that early training 
is far more efficacious than later training, I have compared the 
scholastic attainments of the kindergarten with the non-kinder- 
garten trained children of our group. Of the 828 ten-year-olds, 205 
had no kindergarten training, 615 had attended the kindergarten 
from 20 to 185 days with a median attendance of 132.5 days, and 
8 had attended fewer than 20 days. The records of the 8 children 
who had attended fewer than 20 days were not included in the 
computations. ; 

In Table I a comparison is made on the basis of the average of 
the EA, MA, SA, and SES of the kindergarten and non-kinder- 
garten groups. 


Taste I.—Averace Epucationan Acr, Menrat Agr, ScHoon ATTENDANCE, 
AND Socio-Economic STatus oF KINDERGARTEN AND 
Non-KINDERGARTEN GROUPS 


Aver. E. A. | Aver. M. A.| Aver. S. A. | Aver. S. E.S. 


in months in months in days in sigma 
Kindergarten......... 139.69 134.39 701.48 502.07 
Non-Kindergarten..... 137.56 132.67 655.98 494.63 
Wifferencen estes 2.13 1.72 45.50 7.44 


The kindergarten group surpasses the non-kindergarten group 
by two months in educational age, but in mental age it also excels 
and by almost the same amount. Moreover, in school attendance 
and socio-economic status the kindergarten group excels by 45.50 
and 7.44 points, respectively. In order to determine the signifi- 
cance of these differences, I computed the S. D. of the difference 
for each of the four items. ‘These are given in the following 
tabulations: 
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Tass II.—S. D. or ran Dirrerences BeTWEEN THE KINDERGARTEN AND Non- 
KINDERGARTEN Groups REpPorTED IN TaBun I 


Chances in 
} Diff. + 100 that 
8.D. (dis) |S.D. (aver)|S.D. (diff) | S.D. (diff) | Diff. is 
Greater 
than 0 
Educational Age 
Kindergarten...... | 14.69 | 592 | yi | 1.68 96 
Non-Kindergarten..| 16.17 TERC2 OWA AE eoeenticae acon annie Sean thes [loner are 
Mental Age 
Kindergarten....... 19.36 .781 | 
Non-Kindergarten. | 19.78 | 1.381 | 1.59 1.08 | 86 
School Attendance 
Kindergarten.......] 98.99 3.992 | 
Non-Kindergarten. .| 116.94 8.167 9.09 5.01 | 100 
Socio-Economic Status 
Kindergarten....... 27.61 ih, Tals 
Non-Kindergarten. | 27.50 | 1.921 2.22 | 3.35 | 100 


If the differences between the averages given in Table I are to 
be considered significant differences, they should be at least three 
times the size of the S. D. of the differences. As shown in the 
fourth column of Table II, this is not the case for either educa- 
tional age or mental age, but it is the case for both school attend- 
ance and socio-economic status. There is, therefore, no significant 
difference between the two groups in either educational age or 
mental age, in spite of the fact that there is a significant difference 
between the groups in school attendance and socio-economic status, 
in which the kindergarten group excels. Despite the fact that the 
kindergarten group attends school better and has a better socio- 
economic status, it does not give evidence of a real superiority in 
either educational age or mental age. 

Another method of making these comparisons is to compute, by 
means of the bi-serial r method, coefficients of correlation between 
kindergarten attendance and each one of the four variables. The 
results of these computations are set forth in the following 
tabulation : 
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Coefficients Standard Error 


TARA ies tc shoal ste alinierers 5 0834 0498 
KAMER seer tare sc ot 0521 .0500 
TASS Aas ieee rte caiponts 2517 0478 
OAS S BIS eete eepeta re). asc sversts 1583 0492 


All of these coefficients are positive, but only in the cases of 
KA-SA and KA-SES are the coefficients more than three times the 
standard errors. It is only in these two cases that we can be 
reasonably sure that a true positive correlation is indicated by 
the obtained correlations. These results, then, are in accord with 
the conclusions just expressed with reference to the effect of kinder- 
garten attendance upon the scholastic achievement of the ten- 
year-olds. 

It might be assumed that the differences between the two groups 
in school attendance and socio-economic status were not sufficient 
to have any significant effect upon school achievement. If we grant 
this assumption, and express the achievement only with reference to 
the average mental ages of the two groups by computing the ac- 
complishment ratios from the average .A.’s and M.A.’s, we obtain 
for the kindergarten group an A.R. of 103.9 and for the non- 
kindergarten group an A.R. of 103.7. 

From the three different methods of treating the data, no evi- 
dence appears that ten-year-old children who have a median kin- 
dergarten attendance of 132.5 days will do any better on the Stan- 
ford Achievement Test than the ten-year-olds who have never at- 
tended the kindergarten. However, these results should not be 
interpreted to mean that kindergarten training is of no importance, 
for such training may adapt the children better to their school life 
in the first grade, and aid the children in acquiring many other 
responses not usually regarded as scholastic in character. 


2. Results of Measurements 


As previously stated, the children varied only 9 months in life 
age, but in spite of this narrow range in life-age they varied con- 
siderably in the other measurements. In Table III are given the 
measures of variability and central tendency for educational age, 
mental age, school attendance, socio-economic status, life-age, and 
grade placement. 


SOME HEREDITARY AND ENVIRONMENTAL FACTORS 49 


TaBLe IIJJ.—ExtTrREME AND TENTH TO NINETIETH PERCENTILE POINTS 


Extreme Difference 10th to 90th Difference 


Points Percentile 
E. A. in months. 90—205 115 121.61—158 .28 36.67 
M. A. in months. . 80—215 135 110.14—158 .52 48 .38 
S. A. in days........] 300—1000 700 541.80—795.61 253.81 
S.2-S.in' 8: D.- 445—545 100 459 .48—536.82 77.34 
L. A. in months....} 121—130 9 121.99—129.13 7.14 
Gobsin gradess.ccc: 2— 6 4 3.55— 5.36 1.81 


The differences in the first column of differences are somewhat 
larger than they should be for E.A., M.A., and S.E.S., because they 
represent the lower and upper limits of the lowest and highest in- 
tervals of the distribution tables. Five points were included in each 
interval. All ranges of these three items are also somewhat too 
large, because the reliability of the measuring instruments was not 
perfect. However, for E.A. and M.A. the reliability of the tests is 
so high as to make the error for these items negligible. 

The distributions, excepting those for L.A. and S.E.S., conform 
fairly well to the requirements of a normal distribution. For L.A. 
and S.E.S. the distributions are more nearly rectangular. The man- 
ner of selecting the children explains this type of distribution for 
L.A. However, on account of the fact that the ages were computed 
only to the nearest month, the first and last intervals of the L.A. 
distribution have only about one-half as many cases as the other 
intervals. The rectangular nature of the distribution for S.E.S. 
may be partly due to the fact that the children were selected only 
from those schools which were located in communities of average 
socio-economic status. As the G.P. was determined for the children 
during the first semester and the tests were given during the second 
semester, the extreme points for G.P. at the time.the tests were 
given would be 214 and 614, if the extreme cases received a promo- 
tion at the end of the first semester. 

Children who were in the lowest division of Grade IT received 
a score of 2, while those in the highest division received a score 
of 214 for G.P. Similar scores were given to children in the other 
grades. 

If the 10 percent of the children who made the lowest scores and 
the 10 percent who made the highest scores are eliminated from the 
computations, the range for the remaining 80 percent is still very 
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large for a group of children whose extreme range in life age is 
not more than 9 months, as may be seen in the last column of the 
table. These remaining 80 percent have an extreme range of 3 
years in H.A., 4 years in M.A., 254 days in S.A., and 2 grades 
in G.P. 

Additional measures of variability, as well as measures of cen- 
tral tendency, are given in Table IIIA. 


TasuE IITA.—Mrasures oF CENTRAL TENDENCY AND VARIABILITY 


Arithmetic Standard Coefficient of Vorickility 

Mean Deviation | of Variability (Woolf) 
E. A. in months....... 139.24 15.06 . 1082 . 1428 
M. A. in months...... 134.05 19.51 -1455 alSos) 
Sh JANd VCE ASICs abides 690.32 104.30 1511 .1688 
Sh DAS Sh IDE o5 das 500.29 20.13 .0554 .0740 
pA pinimonths) ene 125.78 2.607 .0207 .0282 
G. Pein'grades,... 4.0 4.28 .656 21583 olis3 


The average attendance is given only to the end of the first 
semester, 1926. The average G.P. is that which obtained for the 
first semester. Therefore, for the second semester, if practically all 
of the children were promoted, the average G.P. should be 4.78, or 
5 of a grade higher.” As the units of the different measuring in- 
struments which were used in the investigation differed in size, 
the coefficient of variability affords a better basis for comparing 
the variability of the different variables than does the standard 
deviation. These values are given in the third column of Table 
IIIA. Upon the basis of these coefficients, M.A., S.A., and G.P. 
are approximately equal in variability. For E.A., variability is 
somewhat lower. It is still lower for S.E.S., and, of course, con- 
siderably lower for L.A. 

In the last column of Table IIIA, the coefficients of variability 
for Miss Woolf’s investigation are given. Excepting M.A., all of 
her coefficients are somewhat larger, as they should be, because for 
her children the extreme range in life age was 3 months more than 
for the children of this investigation. §8.E.S. is larger because no 
attempt was made to select schools in communities of average social 
status, as was the case in this investigation. In spite of this fact, 
there is practically no diffexence between the two coefficients 
for 8.E.S. 
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In Tables III and IIIA it has been shown that there is a large 
amount of variability in E.A. The important fact to be determined 
in this investigation is the amount of variability in E.A. which may 
be ascribed to each of the factors which have been selected as the 
causes of this variability. If the major portion of the variability in 
H.A. is due to S.E.S. and S.A., then we would have to conclude that 
the environmental forces as measured by these factors were the 
most potent in determining differences in educational results as 
measured by E.A. If, on the other hand, the major portion of the 
variability in E.A. is due to M.A., then we would be justified in 
concluding that heredity, as measured by M.A., was most potent in 
determining differences in education. If, in the third place, a large 
portion of the variability in E.A. remains unaccounted for, it will 
be impossible to determine definitely and conclusively from the 
data of this investigation whether heredity or the environment is 
the main cause of the differences in the scholastic abilities of our 
group of children under the present environmental circumstances 
of school and social life. 

In order to determine which of our several factors contributes 
the most to the variability in E.A., the correlation technique has 
been employed. In Table IV the zero-order coefficients are given 
for the different factors. The factor of grade placement does not 
appear in this table because, as previously pointed out under the 
general plan of the investigation, our technique is inadequate to 
deal with this particular influence. 

As it appeared from an inspection of the correlation tables that 
all correlations were of the linear type, all zero-order coefficients 
were computed by using the usual Pearson formula. 


Tasie [V.—ZnRo-OrpER COEFFICIENTS 


| 
—— | — 


EDA? sfotelere oe .737.—-«.0106 | .447.—- «.0188 | 3856 =. .0205 | .1385 ~— .0280 
VE PAS Neiet stove ete Be eae a .817 =-.0211 | .3859 §=.0204 | .120 = .0231 
SiAA tense Lage pcos Pores ah as .169 .0228] .415 .0194 
op Dist cect ousl POR, wn Weta el eee ae Yee Ne a cot .001  .0217 


The coefficient for L.A. and S.E.S. is insignificant, as it should 
be expected to be. The coefficients for L.A. and H.A. and for L.A. 
and M.A. are only slightly significant. These, as well as the coeffi- 
cient for L.A. and §.A., would have been much higher had the chil- 
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dren not been so selected as to limit the range in life age to nine 
months. 

It might also be assumed that, as an attempt was made to select 
schools in communities of average social status, the coefficients for 
S.E.S. and each of the other variables are too low. This may be the 
case, but Miss Woolf? who included in her investigation all of the 
ten-year-olds of the Greeley Schools, exclusive of those for foreign- 
speaking parentage, obtained even lower correlations for S.E.S. and 
the other variables. I am giving her correlations in the following 
tabulation, excepting the one for S.E.S. and L.A., which she did not 


compute. S.E.S-B.A,  .369 
S.E.S.-M.A.,  .257 
S.E.S.-8.A.,  .056 


The coefficient for E.A. and S.A. is lower than it should be if, 
as some contend, absence from school does not have any significant 
effect until a certain amount has been reached... While the number 
of cases with different amounts of absence is probably too few to 
furnish any very reliable evidence on this point, I am, neverthe- 
less, giving in Table V medians in E.A., M.A., and S.E.S. for dif- 
ferent amounts of school attendance. 


Tasie V.—Meptans 1n Epucationat Aan, Menta AGz, AND Socio-Economic 
Srarus ror DirrFeERENT AMOUNTS OF SCHOOL ATTENDANCE 


Med. E. A. | Med. M. A. | Med.S.E.S. 
Attendance in Days | jn Months | in Months |inS.D. Units | Frequencies 


SOONGUN re eter 151.7 143.3 515.0 38 
Len! MEE em ae Ue "141.8 "134.2 *506.4 98 
740 aie ete 141.7 137.1 505.1 169 
[AU A aS 140.8 "134.8 507.8 119 
Ghee Sires eee A *135.6 132.8 * 502.8 121 
CoA ae nape aps 133.5 134.1 501.6 70 
* * * 
BeUs ve ake ace 129.8 122.9 489.7 81 
115 Oe aL EY 132.5 126.5 498.8 43 
BOER) cis em aie "197.5 193.1 483.8 33 
PAO pe Sonic ts 197.4 124.0 "478.3 40 


* Woolf, Henriette K. Loe. cit., p. 14. 
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The attendance of each of the ten groups (Table V) is less by 40 
days than that of the preceding group, excepting the last group 
which attended 100 days less than the preceding group. Asterisks 
indicate the largest group differences in E.A., M.A., and S.E.S. 
The most marked decrease in E.A. occurs between the groups of 
best and next best attendance. However, in consideration of the 
fact that the attendance range of the best attendance group is twice 
that of the other groups, the decrease should be reduced by one-half. 
Even with this reduction, it is as large as any of the decreases. The 
data do not, therefore, show that a decrease in E.A. is more likely 
to be associated with a decrease in attendance in one than in any 
other part of the entire range of attendance. 

It appears, moreover, from a comparison of the largest differ- 
ences in E.A. with the largest differences in M.A. and S.E.S. as 
indicated by the asterisks, that they occur with surprising regu- 
larity between the same groups. The connections among these 
variables appear to be far more intimate than between any of them 
and school attendance. 

On account of variable or observational errors in measurement, 
all of the coefficients involving E.A., M.A., and 8.E.S. are too low. 
To correct for this attenuation, I used the following formula: 

PaiBi 
V Vasa! Bip 

In this formula rap represents the true correlation between any 
two tests, A and B; Yra,p, represents an obtained correlation be- 
tween any two tests A,B,, and the expressions in the denominator 
represent coefficients of reliability for any two tests. 

The coefficients involving H.A., M.A., and 8.E.S., corrected for 
attenuation, are as follows: 


CAB aw 


1 .7709 TA — SAN eae 3287 Isa-ses — 1812 
A= SAL st 4497 Tua-sng — 0991 

Ypa-seg — .3840 DAS An coe 1244 

IenyNcaby a .1358 


The self-correlations used in making the above computations are :® 
Rea = .988 Rya=.98 Reg=1.00 Rengs=.87 | Ria = 1.00 


’ For the first of these, see Kelley, T. L., Ruch, G. M., and Terman, L. M. 
Stanford Achievement Test. Manual of Directions. World Book Co., 1924; 
for the second, Terman, L. M., and DeVoss, J. C. ‘‘The educational achieve- 
ments of gifted children.’? The Twenty-Third Yearbook of this Society, 
Part I, 1924. 
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As the subjects of this investigation ranged in age only 9 
months, and were located in schools selected upon the basis of aver- 
age social standing, the reliability coefficients for H.A., M.A., and 
S.E.S. might have been increased somewhat if they had been cor- 
rected for range, but as this was not possible for S.E.S. (on account 
of the absence of the necessary S.D.), the correction was not made 
for any of them. 

On account of the narrow range of L.A., which has the effect of 
reducing the sizes of the coefficients between it and the other vari- 
ables, it would give a false picture of the relative influence of the 
different independent variables upon variability in E.A. if its in- 
fluence were included along with the influences of the other vari- 
ables. Therefore, the influence of L.A. was rendered constant by 
partialling it out from the coefficients as corrected for attenuation. 
The corrected coefficients were modified by this procedure and are 
set forth in the following tabulation: 


CHASMA .7670 Pira-cA .8070 Tsa-szes — 1987 
IRB REGIS 43863 TMa-ses — 4021 
Tpa-ses — 3875 


These are the coefficients which were used to determine the rela- 
tive importance of M.A., S.A., and S.E.S. in producing variability 
in E.A., both by means of the regression equation and the method 
of path coefficients. However, neither of these methods estab- 
lishes a causal relationship between the dependent variable on the 
one hand and the independent variables on the other. Such a rela- 
tionship must be assumed, or shown in other ways. These methods 
simply show how much variabiliiy in E.A. is associated with a unit’s 
variability in each of the independent variables. If reason leads 
us to accept a causal connection, tren the relative influence of M.A., 
S.A., and S.E.S. upon I.A. follows. 

The special form of the regression equation with §.D. for each of 
the variables equal to 1 was employed. Here are both the formula 
and the equation. 

D.C res bga, MA, SA, sesXma 1 bya, SA. MA, SESXSA aL bza, SES. MA, SAXSES 
Xen = 6711 Xyaq + .2155 Xs, + .0751 Xgug 

As a check on the accuracy of these computations, the coefficient 
of multiple correlation was computed by two different formulae. 
The following multiple coefficients were obtained: 

Rsga (MA, 84, SES) = -7985 
Rga (ma, 8a, SEs) = -7982 
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If, as stated before, a causal relationship is assumed between 
E.A. and each of the independent variables, then a change of 1 unit 
in M.A. will produce a change or variation of 24 of a unit in E.A.; 
a variation of one unit in S.A. will produce a variation of only 4% 
of a unit in E.A.; and a variation of 1 unit in §.E.S. will produce 
a change of less than 4, of a unit in E.A. 

In order to bring my results into as intimate relation as possible 
with those found by Miss Woolf,* I computed a regression equation 
for these four variables without correcting for attenuation and 
without rendering life age constant. These computations were not 
made by Miss Woolf. The same formula was employed for each of 
the equations, but the computations were based upon the coefficients 
found in Table IV. I again checked my computations by means 
of the method of multiple correlations, obtaining .7767 by one 
formula and .7772 by the other. 


CoMPARABLE RESULTS OF OTHER INVESTIGATIONS 
The comparable equations obtained by Miss Woolf and by me 
are as follows: 
Woolf... ..... Xe, = 6285 Ky, + 1734 Xp, + 1989 Xu 
Heilman... ...Xu, = 6366 Xu, + .2311 Xg, + 0988 Xu 


The correspondence between these equations appears to me to be 
surprisingly close when it is remembered that Miss Woolf had only 
157 cases with wider variations in L.A. and S.E.S., and that her 
investigation was made in a small town of about 12,000 inhabitants, 
while the data for this investigation were obtained in a city of about 
300,000 inhabitants. Unfortunately, Miss Woolf does not give any 
check on the accuracy of her computations. 

Dr. Whitney and Mr. Woody express their results in terms of the 
regression equation. These results are subject to the limitations of the 
partial correlation technique which have been mentioned. Whitney and 
Woody’s investigation differed materially from this one in the number 
of children and their age, the method of selecting them, the nature of 
the community, and the number of variables. Moreover, different tests 
were used for the purpose of measuring intelligence and socio-economic 
status. It should be noted especially that the children differed widely 
in life age, as an unselected group of ninth-grade children would. The 
variables with their coefficients and the squares of the coefficients are 
given in the following tabulation: 


* Woolf, Henriette K., Loc. cit., p. 14. 


ws s 


THE TWENTY-SEVENTH YEARBOOK 
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Regression Coefficients | Squares of Coeificients 


Giris 

- Mental age... .... wets < .50388 19: .2388 
Pale BHO So. a ee ek — 2572 — 24 .0662 0637 
Socio-economie status ..| .1069 | = .Q0S0 O14 0001 

4. School traiming. .... 2... — .0752 — 142 . 0057 1s 
5. Distance from school...) — .Q457 .0155 21 .0002 
G- Phyndque sa os ; —.0821 — .0S810 . 0008 . 0066 
7. Permanence of residence 40 | = 0786 .6c00 . 0062 


The approximate percentage contribution of each of the causal 
factors to the effect, in this case educational age, may be read from the 
last two columns of the tabulation. There are only two significant causal 
factors, mental age and life age. However, all of the figures are un 
doubtedly too low on account of the technique employed. As the co- 
efficient for life age is negative, age depresses achievement, as was to 
be expected for a group of ninth-grade children, the older and duller 
children being classed with many younger and brighter children. Mental 
age is, therefor, the only positive coefficient whose contribution to varia- 
tion in educational age is of any significance. 


Use oF THE Merson oF Pats COESFFICIENTS 


At the suggestion of Barbara Burks, and with her assistanee, 
I applied to my data the method of path coefiicients deseribed by 
Sewell Wright” This method has several advantages over the ardi- 
nary correlation method. In a complex system of correlated varia- 
bles, the coefficient of correlation gives the total amount of relation- 
ship between two variables due to all of the paths by which they 
are connected, combined, while the path coefficient measures the 
direct influence of each variable upon any other with whieh it is 
eonnected, independently of the effect of other variables For ex- 
ample, in our problem M.A. has a direct effeet upen E.A., but it 
also has indirect effects upon E.A. through S.E.S. and S.A. The 
coefficient of correlation represents the combined direct and indireet 
effect of M.A. upon E.A., but the path coefficients give the direct 
effect and all of the indirect effects separately. 

However, whether one variable has a causal effect upon an- 
other must, by this method as well as by all correlation methods, 
be assumed or determined in other ways. 

The symbol for the path coefficient is Px. in which X is the 
dependent variable or the effect, assuming a causal relationship, . 


“ Wright, Sewell. ‘‘Correlation and causation?’ Jour. of Agricultural 
Besearok, 20: No. 7, 1921. 
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and A is the independent variable, or cause. The path coefficient 
is defined as the ratio of the standard deviation of X due to A to 
the total standard deviation of X. It is, therefore, the coefficient 
for the path of influence from A to X. 

Another coefficient discussed by Wright 7° is the ‘‘coefficient of 
determination,’’ represented by the symbol dx. which is equal to 
P?x 4. This measures the percent of influence which A has upon X 
in a given system of correlated variables. If all of the causes are 
accounted for, the sum of the coefficients of determination equals 1. 

In Fig. I the variables of our problem, with their path coeffi- 
cients, are represented.*4 


Fig. I. VARIABLES AND PATH COEFFICIENTS 


In this diagram: 
EA = educational age 
MA = mental age 
SA = school attendance 
SES = socio-economic status 
X = complex of all factors, other than MA, SA, and SES, determin- 
ing EA 
Y = complex of all factors, other than MA (or more accurately, other 
than that portion of heredity which correlates perfectly with 
MA), determining SES 
Z = complex of all factors, other than MA and SES, determining SA 


u This diagram, with the equations for computing the path coefiicients, 
was furnished by Miss Burks. 
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The lines in the diagram represent the path coefficients. They 
represent not only the amount of direct influence of one variable 
upon another, but also the direction of the influence. These paths 
must be set up upon the basis of deductions regarding the influence 
of one factor upon another. As has been said, we have no sure 
knowledge of whether the directions of influence are actually as 
they have been represented, whether some of these directions 
may be reversed, or whether reciprocal influences may not ac- 
count for some of the relationships. The directions of paths 
which are here postulated seem to me the most reasonable of 
possible ones on a priori grounds. However, the correlations be- 
tween 8.E.S., M.A., and S.A. might be due to an entirely different 
system of influences than that postulated. But even if this should 
be the case, the computed path coefficient of each with H.A., upon 
which the chief conclusions of this study are based, would remain 
unchanged. In the event of reciprodal action between E.A. and 
any of the three variables S.E.S., M.A., or 8.A., the path coefficients 
would indicate an upper limit which the true causative effect of 
these variables probably would not exceed. The paths are labeled 
with small letters. What each of them represents may be gathered 
from the following eqilations: 


Prema —d Psama = 1h Psrgma = ¢ 

Pra se =’ 2 Psa ses = € Psesy = bd 
EASES — @ Pag = 4 

Pux =f 


Because the coefficient of correlation between any two variables 
is equal to the sum of the products of the paths in each series of 
paths with which they are connected, plus the path of direct con- 
nection, it is possible upon this basis, if the coefficients of correla- 
tion are known, to set up enough simultaneous equations to deter- 
mine the value of each path represented by the letters. This will 
give the following equations: 

Tra-ma — ac + gh+d-+ceg T'ma-ses = @ 

Tpa-ses — a + eg + cgh+ ed Tua-sa —h-+ce 

Ypa-sa — g+ae+ach+dh-+ ede Igns-sa =e + ch 

In these equations all of the coefficients of correlation as well 
as the value of ¢ are known. From these data the value of each 
path coefficient can be determined. They are as follows: 


a — .0750 e = .0898 = ).9106 
d = .6706 e = .4021 i = .9481 
g = .2155 h = .2709 f = .6022 
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By a comparison of the path coefficients a, d, and g with the co- 
efficients of the regression equation computed with the same zero 
order coefficients, I found that they were equivalent. Wright’s 
path coefficient is, in this situation, therefore, the same as the co- 
efficient of the regression equation. However, his coefficient of 
determination is equal to the square of the path coefficients or the 
square of the coefficients of the regression equation. Now, as 
Wright” has shown that, where the relation is causal, the coeffi- 
cient of determination measures the percent of influence which any 
one of a number of causes has upon their combined effect, or the 
percent of influence which each independent variable has upon the 
variability of the dependent variable, it follows that the squares of 
the regression coefficients measure the percent of influence of each 
independent variable upon the dependent variable. It also follows 
that the relative weights of the regression coefficients for deter- 
mining relative influence are not the ratios of these coefficients, but 
the ratios of their squares. 

Wright**® says: ‘‘It can be shown that the squares of the path 
coefficients measure the degree of determination by each cause. If 
the causes are independent of each other, the sum of the squared 
path coefficients is unity. If the causes are correlated, terms repre- 
senting joint determination must be recognized. . . . The squared 
path coefficients and the expressions for joint determination meas- 
ure the portion of the squared standard deviation of the effect due 
to the causes singly and conjointly, respectively.’’ 

Therefore, in order to determine the fractional part which each 
of our independent variables contributes to their combined effect 
or to the amount of variation of the dependent variable, it is neces- 
sary to compute the coefficients of determination. 

The fundamental equations for these coefficients are: 

dxa= Pex, 
dx ap = 2Px.aPx slap 


Sdx.a Ae Sdn =I] 


“Wright, Sewall. Loc. cit., page 31. 


Wright, Sewall. ‘‘Systems of mating. I. The biometric relations be- 
tween parents and offspring.’’ Genetics, Vol. 6, 1921. 
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For our problem, these equations may be written: 


1. dszrgma = ¢? =) 1617 
2. dszs.y = b? = :8383 

1.0000 
é:dga sug = e? = .0081 
4. dgama = h? =e (G34 
5. dga.seSMA — 2ceh = .0196 
6. dgaz = i? = .8989 

1.0000 
7. dzasrs = a? = 0056 
8. drama = d? = 4497 
9. drasa = g? = .0464 


10. dea gases = 2age+2agech = .0064 
11. deasama = 2dgh+2dgce = .0887 


12. dza.sesSMA = 2ade = .0405 
13. deax — f? = 3627 
1.000 


I obtained a check on the accuracy of the computations involved 
in the second group of equations above by letting i? equal 
1-R?ga(ma, ses). The value of i? is given in Equation 6 as .8989, 
and the check value is .8990. Similarly, a check on the accuracy 
of the computations involved in the third group of equations was 
obtained by letting f? equal 1-R?zacma, sa, szs)- Equation 13 gives 
the value of f?, which is .3627. The check value is .3624, if R is 
taken as .7985, and .8629 if it is taken as .7982. These different 
values for R were obtained by means of two methods of com- 
puting R. 

Although, in this investigation, we are not concerned with the 
relative importance of the factors contributing to variability in 
S.E.S. and 8.A., it may, nevertheless, be pointed out that only 16 
percent of the variation in S.E.S. is determined by the part of 
heredity measured by the Binet Scale, while 84 percent is deter- 
mined by other factors. Similarly, not more than 7 percent of the 
variation in school attendance is determined by M.A., less than 
1 percent by S.E.S. and about 2 percent by S.E.S. and M.A. com- 
bined; the remainder, 90 percent, is determined by other factors. 

We are chiefly concerned with the third group of equations 
which gives the percent of variation in E.A. determined by each 
of the independent variables separately and in different combina- 
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tions. Only about one-half of 1 percent of the variation in B.A. 
is determined by S.E.S. directly. Less than 5 percent of the varia- 
tion in E.A. is determined by S.A., while 45 percent of it is deter- 
mined by M.A., and 36 percent by other factors. These percentages 
give the direct influence of the different variables in determining 
variation in E.A. These variables also have an indirect or joint 
influence upon variation in E.A. which cannot be scparated. The 
joint influence of S.A. and S8.E.S. is only a little more than one- 
half of one percent, while that of M.A. and S.A. is about 9 percent, 
and of M.A. and S.E.S. about 4 percent. If the indirect, or joint, 
influence of M.A. upon variation in E.A. could be separated, it is 
not unlikely that it would account for at least 50 percent of the 
variation in E.A. Moreover, what the other influences representing 
36 percent are, we do not know. They may be evenly divided among 
the unaccounted for or unmeasured environmental and hereditary 
factors. If this assumption is made, then at least two-thirds of the. 
variation in H.A. is determined by the hereditary factor. 

If the relative weight of our causal factors is based only upon 
their direct influence in contributing to variation in E.A., then 
M.A. has 10 times as much influence as S.A., 80 times as much as 
S.E.S. and 9 times as much as 8.A. and 8.E.S. combined. Differ- 
ences in heredity, then, are 9 times as potent in producing differ- 
ences in education as are our measured differences in environment. 
By differences in heredity in this sentence, is meant such as are 
measured by the Stanford Revision of the Binet Seale; by differ- 
ences in education, such as are measured by the Stanford Achieve- 
ment Test; and by differences in the environment, such differences 
in socio-economic status as are measured by a revised Chapman- 
Sims Scale, and such differences in training as are represented by 
differences in school attendance. Moreover, by the word ‘‘differ- 
ences’’ is meant only such as obtained in the schools and commu- 
nities involved in the investigation. 

It should also be pointed out that these results are no index 
of the efficiency of the schools and of the home in educating chil- 
dren. They do not show how far the children have been lifted 
above a zero educational level. They do purport to show under 
existing educational and socio-economic conditions which of the 
causal factors is most potent in explaining the enormous differences 
which are so prevalent in educational achievement. 
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For the data on E.A., M.A., S.A., and S.E.S. obtained for the 
ninth-grade girls, Whitney and Woody employed the method of 
path coefficients. They treated these results in exactly the same 
way as the results of this investigation were treated, with the ex- 
ception of correcting for attenuation. I have tabulated here their 
coefficients of determination, and for the sake of comparison have 
repeated the coefficients of this study. 


Coefficients of Determination 
Whitney-Woody Heilman 


Pe“ GnG wan. ese citer .0013 1617 
On Csi Gavan ate 9987 .8386 
1.0000 1.0000 

SIEM aR Se sel. teen .0023 .0081 
cS BT's Oe yank hewn Sn .0022 .0734 
DC ACS OCS Renee —.0001 .0196 
Godda! Mee teste aa 9956 8989 
1.0000 1.0000 

Ms Ca eballs settee y .0014 .0056 
S: Bll macie acne hie ee ney 4497 
Oe aera ae | paket, Salalah .0154 .0464 
LO Gai enee. bs-occs 2 —.0004 .0064 
Land face kee oe —.0056 .0887 
12, OWA RROMA.. ch eniln: —.0014 .0405 
iS ants Pte eerie ean 1374 .3627 
1.0000 1.0000 


The most striking difference between these two columns of co- 
efficients is that, for the known variables, those in the first are very 
much lower than those in the second column. This disparity is in 
part due to differences in the amount of variation of corresponding 
variables of the two investigations, and in part to the fact that the 
Whitney-Woody coefficients are based upon raw correlations, 
whereas my coefficients are based upon corrected correlations. For 
this investigation the coefficient of variability for S.A. is 1.6 times 
the S.A. coefficient for the Whitney-Woody study. The coefficients 
for the other variables show negligible differences, amounting to 
about one-tenth of the size of the higher of each pair of coefficients. 
It is, moreover, significant that in the Whitney-Woody study at- 
tendance was reported only in full years; smaller differences in 
attendance were not recognized. It appears, therefore, that a child 
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had to attend school only for a month or two in any year to be 
credited with one year’s attendance. 

I applied Thomson and Pintner’s criterion for the unselected 
nature of a group to Whitney and Woody’s data. The coefficient 
of correlation for life age and mental age was found to be —.160 
and the ratio of the coefficients of variation of life age and mental 
age .461. Corresponding figures for this investigation are .120 
and .142. The requirement is approximate equality of the co- 
efficient and ratio. With these and other rather marked differ- 
ences between the two investigations, any close correspondence be- 
tween the two sets of path coefficients should not be expected. 

In one respect, however, the two investigations do show a strong 
tendency in the same direction. Both of them show that mental 
age has a marked preponderance over all of the other causal fac- 
tors which appear in the investigations. In fact, this preponder- 
ance is relatively much greater for the Whitney-Woody investiga- 
tion than for this investigation. 

Our schools and higher institutions of learning could be very 
much improved if they were more concerned with the formation 
of classes upon the basis of equal native endowment than they are 
with making classifications on the basis of equal training. Cur- 
ricula are made and administered without any, or very little, refer- 
ence to individual differences in native talents. I am of the opinion 
that under existing conditions there is the greatest need for an 
efficient personnel department in our schools and colleges for the 
purpose of steering the children and students educationally and 
vocationally. Such, at least, are the conclusions I have arrived 
at both from the results of this investigation and from my experi- 
ences as a teacher and student in the public schools and in higher 
institutions of learning. 


A SuMMARY OF THE MAIN RESULT, WITH SOME OF THE 
PracticaL IMPLICATIONS 


Educational achievement was measured by the Stanford 
Achievement Test, and was expressed in terms of educational age. 
In reality, therefore, we determined which of the causal factors, the 
hereditary or the environmental, were the most potent in produc- 
ing educational age. The hereditary factors are represented by 
mental age as measured by the Binet Scale; the environmental fac- 
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tors are represented by school training as measured by school at- 
tendanee, and by the socio-economic status of the home as measured 
by a revision of the Chapman-Sims Socio-Economie Seale. 

We found that ten-year-old children differed widely in educa- 
tional achievement as expressed by the amount of variation in edu- 
cational age. We also found that these children differed widely in 
their standing in the causal factors, differences which are expressed 
in terms of variation. With these facts in hand it became possible 
to determine which of the causal factors made the largest contribu- 
tion to the differences in educational achievement. 

Our results show that probably 50 percent of the variation in 
educational age was due to such hereditary factors as had been 
measured; less than 13 percent of the variation, to school train- 
ing ;** not over one percent, to the socio-economic status of the home; 
and about 36 percent, to such hereditary and environmental factors 
as had not been accounted for. In schools and communities which 
differ from the schools and communities from which our data were 
drawn, the relative significance of these figures would have to be 
modified. But, as by a similar procedure, almost equivalent re- 
sults were obtained for all of the ten-year-old children of Greeley, 
there is some evidence for the belief that they would have to be 
modified only slightly for different school communities of the 
Northern states. 

I am of the opinion that the results obtained have very signifi- 
eant implications for educational practice in the public: schools 
and in the higher institutions of learning. Our results show that 
under the prevailing home conditions and school organization, in- 
tellectual endowment has by far the most powerful influence in de- 
termining differences in achievement in the traditional curriculum. 
It is not unlikely that a similar statement could be made for achieve- 
ment in general. 

In view of the superior potency of intellectual endowment, it 
is far more important to make classifications in the public schools 
and in higher institutions of learning upon the basis of intelli- 


“Thirteen percent represents the sum of the effects of S.A. alone, plus 
S.A. in combination with M.A. It is of interest here to refer to the correlation 
(corrected for attenuation) between H.A. and S.A., which was .44. The square 
of this value, or 19 perecnt, represents the maximal contribution which S.A. 
conceivably makes to variability in E.A., including its connection with E.A. 
through indirect paths of influence, and regardless of the philosophical assump- 
tions underlying the use of the path coefficient technique. 
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gence than upon differences in the amount of training. This un- 
tenable custom of classifying upon the basis of the amount of at- 
tendance is far more prevalent and more rigidly adhered to in the 
colleges and universities than in the public schools. It cannot be 
justified either by the results of investigations or by teaching ex- 
periences. Even though the brightest students devote only one- 
fifth as much time to their studies as the dullest, they still make the 
higher grades. Thus, for example, bright students who have had 
no courses in psychology do far better work in that subject than do 
dull students who have had from two to six courses in the subject. 
Moreover, in the public schools there are bright children who, in 
spite of inferior application, complete the work of the elementary 
school in one-third of the time required by the dull. 

A second important inference is that the activities concerned 
with the educational and vocational guidance of children in the 
publie schools and of students in college should receive more recog- 
nition. There should be departments in the school with the func- 
tions of classifying children, adapting school tasks and methods of 
instruction to their learning abilities, and of determining the 
probable occupational levels at which they could succeed in life. 

Efficient personnel departments should also be established in all 
the higher institutions of learning, charged with the functions of 
determining what students should be allowed to enter, of making 
the best classifications under existing practical limitations, and of 
guiding students into the courses and departments most nearly 
adapted to their interests and intellectual endowments. Such a 
department would eventually render more real service to the future 
citizens of our country than any other department in the institu- 
tion. As heredity is so much more significant than training in de- 
termining differences in educational standing, there should be more 
effort devoted to guidance, even if less effort must then be devoted 
to training. 

This investigation does not determine how much the children 
learn in our schools. It does not show how much the schools have 
contributed in raising the children to their present levels of attain- 
ment. But it does offer some evidence in favor of the contention 
that in human nature there are marked differences in the possibili- 
ties of training which must be respected and provided for in every 
well-regulated school system and higher institution of learning. 


PREFATORY NOTE TO CHAPTER III 


Miss Denworth, gathering her material in New York, furnishes 
data which concur very well with those gathered by Heilman in 
Denver. Although this study and the one by Heilman differ from 
one another in the age range of subjects, and in the fact that Heil- 
man’s study contains data upon the socio-economic status of his 
subjects while this one does not, both studies agree as to essential 
conclusions, 1.e., in showing a significant relationship between in- 
telligence and school achievement, and in showing only a very 
slight dependency of deviations in achievement upon variability 
in school attendance. 

Strictly speaking, it should be added that the question whether 
or not the relationship established between achievement and intelli- 
gence directly expresses the causal dependency of school accomp- 
lishment upon native capacity is beyond the scope of either the 
Denworth or the Heilman study to answer. Logically, it is possible 
that the two factors might have a reciprocal effect upon each other, 
or both might be affected by some common environmental influence, 
as far as either study alone could conclude on the basis of its own 
data. 


CHAPTER ITT 


THE EFFECT OF LENGTH OF SCHOOL ATTENDANCE 
UPON MENTAL AND EDUCATIONAL AGES* 


was KATHARINE M. DENWORTH 
Principal, Bradford Academy, Bradford, Massachusetts 


I. THe PurPosE AND THE NATURE OF THE INVESTIGATION 
1. The Importance of the Problem 


Ever since schools have existed, it has been tacitly assumed 
that if only the school and the child could be brought together and 
kept together long enough, education would result. 

Recently, largely as a result of the great individual differences 
in the mental and the educational status of children of the same 
age revealed by testing intelligence and achievement, doubt has 
arisen as to the effectiveness of school instruction in determining 
mental capacity and educational acquirement. This doubt, per- 
sistent and growing, has given rise to numerous attempts, of which 
the present study is one, to determine to what extent mental and 
educational development are determined by the efforts of the school 
and to what extent by mental endowment. Such information is 
needed to eliminate futile effort and to insure maximal returns on 
the pupil’s, the teacher’s, and the public’s investment in education. 

The best measures of the efficacy of school instruction are the 
mental and the educational growth which accompany time spent 
in school. If attendance at school is highly productive of such 
growth, present school practice may be confidently continued; if 
unproductive of at least considerable educational development, 
present practice should be superseded by productive procedure. 


* This study, undertaken while the author was at Columbia University, 
has benefited by the encouragement and helpful criticism of Professors 
Percival M. Symonds, William A. McCall, Rudolf Pintner, William C. Bagley, 
Thomas H. Briggs, and Bird T. Baldwin (formerly of Swarthmore College). 

It owes much to the long-continued codperation of Dr. Abby Porter Leland, 
Principal, and Miss Mildred Innes, Acting Vice-Principal, of Public School 
157, Manhattan. Codperation was also extended in carrying out the testing 
of intelligence by Professor Pintner, his assistants, Miss Carrie Coy and Dr. 
Gertrude Hildreth, and his graduate students. The scoring of the achieve- 
ment tests was accomplished through the codperation of Miss Della Wood. 
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This investigation, therefore, attempts to determine the effect of 
time spent in school upon mental capacity and educational ac- 
quirement and the influence of mental capacity upon educational 
acquirement. These effects, however, must be studied in relation 
to other mental and educational influences. 


2. The Delimitation of the Investigation 


Of the possible determinants of mentality, this problem deals 
directly only with attendance at school and age; with the potency 
of native intelligence it deals only by inference. Of the possible 
contributors to educational development, the study deals only with 
intelligence, attendance at school, and age. These factors were 
selected for study because they are not only important, but are 
also objective and measurable, and thus lend themselves to scien- 
tifie investigation. 

In as much as group intelligence tests measure many of the 
same abilities as achievement tests, overlapping of tests was avoided 
by using as the measure of intelligence the complete form of the 
Stanford Revision.of the Binet-Simon Intelligence Scale—the most 
valid, reliable, and carefully standardized intelligence test. 

The Stanford Achievement Test, a battery of tests covering all 
the curriculum from the second to the eighth grade, inclusive, was 
selected as the best measure of all-round educational development. 
The derivation of these tests and the procedure in administering 
and scoring them are described in the Manual of Directions.1 This 
test gives a valid and reliable measure of the educational attain- 
ment, not only of the group as a whole, but also of each pupil. The 
reliability coefficient of the entire battery of tests for unselected 
pupils of constant age is .98. The probable error of the educa- 
tional ages, based on the whole test for unselected pupils of ages 
eight to fourteen, is very low, ranging’ from 1.5 to 2.1 months.? 

The total number of days that a child had attended school from 
his initial enrollment was taken as the most reliable measure of 
the time he had spent in school. 


18tanford Achievement Test, Manual of Directions. New York: World 
Book Company, 1924. 


2Ibid., p. 16. 
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Within the limits prescribed by the foregoing selection of 
factors for study and their measures, this investigation endeavors 
to find tentative answers to the following questions: 

To what extent is variation in scores on the Stanford Revision 
of the Binet-Simon Intelligence Scale determined by variation in 
length of school attendance? By variation in chronological age? 

What influence is exerted upon differences in Stanford Achieve- 
ment Test scores by differences in length of school attendance? By 
differences in mental age? By differences in chronological age? 

What is the prognostic value of a combination of the Stanford 
Revision of the Binet-Simon Intelligence Scale, length of school 
attendance, and age in predicting school success as measured by 
the Stanford Achievement Test? 


38. The Method of Investigation 


The ideal way to determine the influence of attendance at school 
upon intelligence and educational achievement is to use the equiva- 
lent-groups experimental method. 

From a large number of children who had never had a day 
of formal instruction at school or at home, two groups would be 
equated on the bases of chronological age and of intelligence. Then 
the experimental group would be sent to school regularly; the 
control group would be given no formal instruction. At the end 
of a year or more both groups would be given identical tests of 
intelligence and of educational achievement. Differences between 
the mental and the educational scores of the experimental and of 
the control groups could then unquestionably be attributed to the 
effect of school attendance. 

However desirable theoretically, under conditions typical of 
modern American life such an experiment is obviously impossible. 
The influences of nature and nurture, which under the relatively 
constant conditions of public school instruction nevertheless pro- 
duce radical mental and educational differences in children of the 
same age, must be studied within more narrow limits. The more 
accurate equivalent-groups experiment must yield to the more 
practicable causal investigation. 

This causal investigation, consequently, does not treat attend- 
ance as either present or absent, but treats of varying lengths of 
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attendance and of the accompanying variations in mental and in 
educational development. 

Owing to the difficulty of selecting two groups exactly equiva- 
lent in educational and chronological ages and two other groups 
exactly equivalent in mental and chronological ages, the statistical 
techniques of correlation, partial correlation, and regression equa- 
tions were employed as satisfactory substitutes. 


II. THe Sources oF THE DATA AND THE PROCEDURE 
IN COLLECTING THEM 


1. The Subjects 


The investigation was made during the spring terms of 1925 
and 1926 in Public School 157, Manhattan, New York City, an 
elementary school with an enrollment of approximately two thou- 
sand pupils. 

To avoid the influence of accidental selection, since all eight- 
year-olds have not yet entered school and many thirteen-year-olds 
have dropped out, only pupils of nine, ten, eleven, and twelve years 
of age were used in the final results. All the children of these ages, 
in whatever grade enrolled, were tested only after further selec- 
tion on two bases. To insure reliable attendance data, pupils were 
eliminated who had not had all their schooling in the public schools 
of New York City. To obviate the influence of a language handi- 
cap on test scores, children from non-English-speaking homes were 
also excluded. In the final handling of results pupils were 
omitted for whom any of the data were missing on account of 
temporary absence or transfer to other schools. 

In the spring of 1925 complete data were secured for 569 pupils. 
To guarantee greater reliability of the results by increasing the 
number of cases, data were obtained the following spring for all 
the new pupils in the school who qualified on the aforesaid bases 
of age, attendance, and language, and for former pupils who had 
been too young to qualify the previous year. Complete records 
were thus procured for 260 additional pupils, making a total of 
829 cases. 

Included in this number are 52 average and brighter eight-year- 
olds and 77 average and duller thirteen-year-olds, whose inclusion, 
however, does not materially alter results, as the influences of the 
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brighter younger and of the duller older children presumably tend 
to neutralize each other. Nevertheless, to insure the greatest possi- 
ble reliability, the final conclusions are based upon only the net 
total of 700 comparatively unselected subjects, that is, the pupils 
of ages nine to twelve, inclusive. 

The children selected for study have had similar incentives and 
opportunities to learn. Their social and economic status is rather 
below the average. Moreover, they have attended only public ele- 
mentary schools of New York City. They are, therefore, essentially 
homogeneous with regard to the quality of their school instruction. 
Their attendance records, however, prove that they are heterogene- 
ous with regard to the quantity of school training. These condi- 
tions increase the validity of the study. 


2. The Materials and the Procedure 


a. The Measurement of Intelligence——In March, April, and 
May of both 1925 and 1926 the complete form of the Stanford 
Revision of the Binet-Simon Intelligence Scale was applied by 
graduate students of Columbia University who had been trained 
in intelligence testing by Professor Rudolf Pintner and who had 
had some experience in testing. 

In order not to invalidate results by spurious selection, forty 
twelve-year-olds who were not tested in 1925 owing to lack of time 
were tested by the same experienced examiners the following year 
through the courtesy of Mr. Viertel and Miss Kornman, the prin- 
cipals, respectively, of Public Schools 48 and 186, Manhattan, to 
which the pupils had been transferred. Harly in June, 1926, after 
the graduate students had discontinued their work, the remaining 
pupils necessary to complete the age-groups for the second year 
were tested by Miss Dorothy Wright, a skilled examiner, who gen- 
erously gave her time. 

To render mental and educational data comparable, each child’s 
mental age was computed forward or backward to the date of his 
achievement test. 


b. The Measurement of Educational Acquirement.—The Stan- 
ford Achievement Test, Advanced Examination, Form A, was used 
above the third grade and the Primary Examination, Form A, in 
the second and third grades. The tests were given within two days 
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of either April 8 or April 30 in 1925 and within two days of April 
22 in 1926. 

To secure uniformity in administering and scoring the tests 
and thus to increase the reliability of the results, the writer, who 
had had considerable experience in giving and scoring standardized 
tests, administered all the tests and scored 75 papers. All the 
other papers were scored and re-checked by an assistant skilled in 
scoring. Accuracy and consistency were thus assured. 

To render the measures of educational attainment comparable 
with mental and chronological ages, the original total scores yielded 
by the Stanford Achievement Test were converted into terms of 
educational age by means of tables in the Manual of Directions for 
the test. 


c. The Measurement of Length of Attendance.—Since the pub- 
lie schools of New York City have for years kept individual cumu- 
lative attendance records, it was possible to procure reliable attend- 
ance data. 

Only cards were used which unquestionably gave a complete 
record of a pupil’s attendance from his first enrollment in school 
to the beginning of the semester in which the tests were given. 
The home-room teacher’s daily attendance record supplied the 
number of days attended from that date to the day of the achieve- 
ment test. All these data were compiled by the writer with careful 
checking of each step. 

The investigation takes account only of the actual number of 
days of attendance, disregarding the relative potency of attend- 
ance at different chronological ages. For a given pupil one day’s 
attendance at six years of age, for example, will presumably not 
yield the same mental and educational returns as one day’s attend- 
ance at twelve years. Obviously, means are needed for equating 
separately the mental and the educational effectiveness of days of 
attendance at each chronological age. 

To equate days of attendance scientifically for mental and for educa- 
tional growth at different chronological ages, it would be necessary to give 
the Stanford-Binet and the Stanford Achievement tests to many children of 
each age from five to fourteen years. Then, by compiling attendance data 
for each age-group and checking them against the data of the tests, it would 


be possible eventually to determine the best mental and educational weights for 
attendance at each age. The labor involved, however, in the preparatory 
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steps of giving the Stanford-Binet and the Stanford Achievement tests to 
many children too young to be used subsequently in the study would render 
the investigation too onerous to be undertaken. 

Since it is better to obtain even a rough measure of the effect of attend- 
ance upon mental and educational development than to permit too meticulous 
a regard for perfection to dclay information indefinitely, the use of the actual 
number of days of attendance, unequated for age differences, seems to be 
justified as the best empirical measure available. It is believed, moreover, 
that the handling of attendance data separately for each age-group, the tend- 
ency of attendance laws to promote school entrance at nearly the same age 
for all children, and the effect of chance in balancing inequalities in large 
groups—it is believed that these considerations eliminate gross inaccuracies 
in dealing with unequated days of attendance. 


d. The Measurement of Chronological Age.—In every case the 
child’s chronological age was reckoned from the date of birth 
(recorded on his cumulative attendance card) to the date of his 
educational test. 

The subjects having a chronological age from 9 years, 0 days, 
to 9 years, 11 months, and 31 days, were included in the nine-year- 
old group, and similarly for the other ages. 


Til. THe DistrRiwution oF THE MEASURES 


The measurements of all the age-groups, of the 700 unselected 
eases, and of the entire group of 829 subjects, including the se- 
lected eight- and thirteen-year-olds, are summarized in Table I. 
This table gives the number of cases involved and the means (M), 
the standard errors of the means (SD of M), the ranges, the stand- 
ard deviations of the distributions (SD), and the coefficients of 
variation (V), of all the distributions. 

The eight- and thirteen-year-old groups are included in the 
tables, but not in the discussion of the results. 


Tasie I.—Summary oF M®ASUREMENTS 


M SD of M_ Range SD V 
Age 8 (52 Cases) 
Chronological Age ene | i 00 12 2.47 
Mental Age (Months)...... 1.67 98 20.62 
Educational Age (Months). . or 54 60 12.40 


Attendance (Days) Te 505.80 499 118.34 
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TABLE I—Continued 
SUMMARY OF MEASUREMENTS 


M SDofM_ Range SD 
Age 9 (186 Cases) 
Chronological Age (Months) | 114.19 .26 12 23/583 
Mental Age (Months)...... 118.45 1.15 92 15.64 
Educational Age (Months)..| 126.62 1.07 76 14.55 
Attendance (Days) ........ 640.90 10.17 744.5 138.63 
Age 10 (222 Cases) 
Chronological Age (Months) | 126.72 24 Dahl ee 5G 
Mental Age (Months)...... 127.0 1.54 120 22.96 
Educational Age (Months)..}| 136.40 1.30 105 19.35 
Attendance (Days) ........ 779.70 10.53 800 156.88 
Age 11 (166 Cases) 
Chronological Age (Months) | 138.63 28 we, 3.67 
Mental Age (Months)...... 139.12 1.69 124 21.72 
Educational Age (Months)..| 150.26 1.49 99 19.26 
Attendance (Days)........ 964.50 | 12.21 | 911 | 157.26 
Age 12 (122 Cases) 
Chronological Age (Months) | 150.57 382 12 3.55 
Mental Age (Months)...... 153.00 2.69 133 30.17 
Educational Age (Months)..| 165.21 1.89 98 DAR22 
Attendance (Days) serene 1146.00 13.35 756 149.82 
Age 13 (77 Cases) 
Chronological Age (Months) | 162.26 12 3.49 
Mental Age (Months)...... 140.79 102 20.80 
Educational Age (Months)..| 155.58 86 17.05 
Attendance (Days) ........ 1258.90 1094 197.68 
Ages 8, 9, 10, 11, 12, 13 (829 Cases) 
Chranslorieal Age (Months) | 131.86 58 72 16.82 
Mental Age (Months)...... 132.61 87 146 25.15 
Educational Age (Months)..| 142.59 .79 126 22.67 
Attendance (Days) ........ 868.09 9.389 1370.5 270.39 
Ages 9, 10, 11, 12 (700 Cases) 
Chronological Age (Months) | 130.51 50 48 13,27 
Mental Age (Months)...... 132.52 97 146 25.56 
Educational Age (Months)..| 142.27 87 126 23.02 
Attendance (Days) ........ 852.60 8.85 | 1240.5 234.26 


12.76 
18.97 
15.90 
31.15 


10.17 
19.29 
16.18 
27.48 


EFFECT OF SCHOOL ATTENDANCE 75 


1. The Central Tendency of the Measures 


The slight excess of the mental over the chronological mean, re- 
vealed by Table I in every group, favors the initial assumption that 
the subjects of this study are, collectively, of average mentality. 

The striking superiority of the educational to the mental means 
may be thought to indicate that for these children acquisition is 
greater than seeming capacity; that is, to indicate that the school 
is not only utilizing all the mental capacity of its pupils, but that 
it is also spurring them on to levels of achievement nearly a year 
in advance of ‘reasonable’ expectation. It seems more plausible, 
however, to attribute the ascendancy of the educational means to 
an overestimate, by the test-builders, of the educational ages cor- 
responding to total scores on the Stanford Achievement Test. 

The low standard errors of the means reported in Table I in- 
dicate that, with the possible exception of the mental mean of the 
twelve-year-olds, much confidence may be placed in the obtained 
means. 


2. The Variability of the Measures 


The outstanding feature of Table I is the wide range of mental 
age within a single chronological age. The brightest twelve-year- 
old, for example, is 11 years, 1 month, older mentally than the dull- 
est. Less remarkable, but almost equally significant, is the range of 
educational achievement within any one age. The most advanced 
ten-year-old, for instance, is 8 years, 9 months, older educationally 
than the most backward. The attendance scores are, also, highly 
heterogeneous. On the basis of 200 days to a school year, one twelve- 
year-old has attended school 4 years, 5 months, 11 days, longer 
than another. 

The size of the standard deviations of mental age shown in 
Table I substantiates the evidence of the ranges as to the heterogen- 
eity of mentality. The standard deviation of mental age for the 
twelve-year-olds (30.17 months) is 5.17 months in excess of the 
standard deviation of mental age for unselected twelve-year-olds. 
The greater mental variability of these twelve-year-olds accounts 
for the larger standard error of their mental mean. The standard 
deviations of educational age, as well as the ranges, denote consid- 
erable educational heterogeneity. The large standard deviations 
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of the attendance distributions warrant the preliminary supposi- 
tion that the pupils of a single age have had quite different amounts 
of school instruction. 

The inclusion of the eight- and thirteen-year-olds in the total 
results leaves the variability of the mental and the educational 
distributions practically unchanged, but considerably increases the 
variability of attendance—results in conformity with logical de- 
ductions. The mental and educational scores of the brighter 
younger and the duller older children tend to fall within the range 
of scores of the unselected cases; whereas the attendance data of 
these younger and older pupils extend both limits of the scale. 


3. Comparison of the Groups 


As the standard deviation and the range are measures only of 
absolute variability within a distribution, the variability of the 
groups was rendered comparable by determining the coefficients of 
variation, V,? of each measured factor for each group. 

The variability of attendance, as Table I shows, is highest for 
the youngest pupils and decreases with each increase of age. Since 
nine years is probably the earliest age at which all pupils are en- 
rolled in school, length of attendance varies most at this age and 
less from year to year thereafter. 

Mental age, on the contrary, varies least for the nine-year-olds 
and most for the twelve-year-olds. Evidently, for these twelve- 
year-old children increments of a comparatively constant common 
nurture, that is, amounts of attendance upon instruction in the 
public elementary schools of New York City, have tended not to 
decrease, but to increase individual variation in intelligence. 

Variability in educational attainment, unlike attendance and 
mentality, shows no decided trend with age. It is, however, lower 
for the eleven- and twelve-year-olds than for the ten-year-olds. 
The decrease in the educational variation of the older pupils, which 
in the twelve-year-old group accompanies an increase in mental 
variation, and the smaller educational than mental variation at 
every age suggest that present instructional effort tends to level 
educational differences among pupils. If this inference is correct, 
it raises a host of questions—irrelevant to this problem, but highly 
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pertinent to education—as to the desirability of uniformity in edu- 
cational products. 

As the most variable group in attendance is the least variable 
in mental and educational ages, and the least variable group in 
attendance is next to the most variable in educational age and the 
most variable in mental age, it is apparent that differences in the 
amount of schooling of pupils of a single age do not account for 
differences in their mental and their educational development. 

The nine-year-olds are the most homogeneous of the groups in 
mental and educational ages, but the most heterogeneous in at- 
tendance. The ten-year-olds, the most variable educationally, are 
on the whole the most variable of the groups; the eleven-year-olds, 
the least variable. The twelve-year-olds, the most variable in men- 
tal age, are the least variable in attendance. 

The relative variability of the groups affects the next phase 
of the investigation, the degree of correspondence between the 
factors. 


IV. The Relationship of the Measures 


The relationship of the measures can be adequately considered 
only in the light of the number of cases, the degree of sclection 
involved, and the variability of the groups studied, As these 
spurious determinants of correlation have been treated in preceding 
sections, the study is concerned with the correspondence existing 
between one series of measures and another. It is interested pri- 
marily in the relation of length of school attendance to mental and 
to educational development, and in the relation of intelligence to 
educational achievement; and secondarily in the effect of chrono- 
logical age upon mental and educational ages and upon attendance. 

From the frequency distributions for each group scatter dia- 
grams were plotted and converted into correlation tables, from 
which the coefficients of correlation were determined by Toops’s 
adaptation of Pearson’s product-moment formula.* 

Linearity, required by Pearson’s product-moment formula, was 
assumed throughout. This assumption is partially justified by the 
prevalence of straight-line relationships in psychological studies 


*Toops, Herbert A. ‘‘Eliminating pitfalls in solving correlation.’’ Jour. 
of Expt. Psych., 4:434-447. 
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and by the tendency of the means of the successive arrays of the 
correlation tables of this study to form straight lines. 

The correlations reported are uncorrected for attenuation. Since 
errors due to chance lower correlation, the coefficients would be 
higher if all variable errors were eliminated. The formula, how- 
ever, for correction for attenuation requires not only approximate 
linearity, but also lack of correlation of the errors with the meas- 
ures and with each other. These conditions are so seldom realized 
in practice that corrected coefficients would probably have been 
too high and would have invalidated the resulting partial correla- 
tions. 

The use made of partial correlation in causal investigations has 
been frequently challenged. Miss Burks, in a penetrating criti- 
cism of the technique of partial correlation, inveighs against hold- 
ing constant, variables which may be wholly or partly caused by 
either of the correlated factors. She, however, concedes the valid- 
ity of using partial correlation to eliminate the influence of age 
on the true relationship between two correlated measures: 

‘<There are situations in which a variable is indisputably a cause rather 
than an effect. In one obvious type of situation, chronological age is such 
a variable. The many studies which have employed partial correlation tech- 
nique to eliminate the contribution of maturity to correlated measures are 
apparently on safe ground, provided they have published the precise age 
ranges of their subjects.’’ 


Since unquestionably age is not the result, but to some extent 
the cause of intelligence, educational achievement, and length of 
school attendance, the use of partial correlation in this investiga- 
tion to eliminate the influence of variation in age is believed to be 
entirely valid. The conclusions of the investigation and their prac- 
tical applications to education are based upon only the correlations 
in which age is constant. 

Partial correlations for two variables constant are also re- 
ported in the tables because they are needed in the construction 
of regression equations. Since no attempt is made to interpret 
these correlations as indicating degrees of causal relationship, the 
use made of them seems to be defensible. 


*Burks, Barbara 8. ‘‘On the inadequacy of the partial and multiple 
correlation technique.’’ Jour. of Educ. Psych., 17:538. 
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The total correlations, the probable errors of the coefficients, 
and the partial correlations of the four variables are given for the 
single age-groups and for the two ageregate groups. Total corre- 
lations are shown for the eight- and thirteen-year-olds, but are 
not discussed. 

Since heterogeneity increases correlation, in studying the cor- 
relation coefficients reported it should be borne in mind that co- 
efficients, regardless of the existing amount of direct relation, tend 
to be higher in the two composite age-groups and in the more varia- 
ble of the single age-groups. 


1. The Effect of Length of Attendance upon Mental Age 


The interaction of length of attendance and mental age is shown 
in Table II, which summarizes this relationship for all the groups. 
Although attendance and mental age have both varied greatly, 
the total correlations of these factors in the small groups are neg- 


TasBLE II.—TorTsL AND PARTIAL CoRRELATIONS BETWEEN SCHOOL 
ATTENDANCE AND MmEntTau AGE. 


Number Total PE Chronological vhronologient 
Age of Correlation sea ge Educational 
Cases Constant Ages Constant 
9 186 197 .05 213 .041 
10 222, 337 04 292 -.079 
11 166 .300 .05 .282 .084 
12 126 .138 .06 118 .094 i 
Restricted Total 700 .503 .02 .213 -.030 
8 52 .046 .09 
13 77 107 .08 
Entire Group 829 AGB1 .02 .199 bee -.047 


ligible. In the youngest and oldest groups the correlation coeffi- 
cients are, respectively, .20 + .05 and .14+ .06. In the eleven- 
year-old group, correlation rises to .30~ .05; in the ten-year-old 
group it reaches .34 + .04, a reliable but low coefficient. Hetero- 
geneity of age produces total correlations of .50 + .02 for the re- 
stricted and .46 + .02 for the unrestricted total—substantial corre- 
lations with low probable errors. 

When the influence of differences in age is excluded, the cor- 
relations are only slightly reduced for the small groups, but are 
diminished for the composite groups to inconsiderable, although 
reliable, correlations of .21 + .02 and .20 + .02. 


80 THE TWENTY-SEVENTH YEARBOOK 


Evidently, for the subjects of this study variation in mental 
development is practically independent of variation in length of 
school attendance. 


2. The Effect of Length of Attendance upon 
Educational Age 


The degree of dependence of educational age upon length of 
attendance in the various groups is indicated by Table III. 


Tasie III.—Torau anp PartraL CorrELATIONS BETWEEN SCHOOL 
ATTENDANCE AND EDUCATIONAL AGE. 


Chronological 


Number Total Chronological and 


A f P.E.r g ; 
= Cases Comes Constant eee ee 
9 186 193 .05 149 076 
10 222 435 .04 403 300 
11 166 335 .05 313 164 
12 126 215 06 203 191 
Restricted Total 700 .617 .02 .295 PAL 
8 52 .096° .09 
13 77 .296 .O7 
Entire Group 829 .608 O01 291 222 


In the homogeneous age-groups correlation is present, but not 
impressive. It varies from .19 + .05 and .22 + .06, respectively, 
in the youngest and oldest groups to .83 + .05 and .44 + .04, re- 
spectively, in the eleven- and ten-year-old groups. The higher cor- 
relation of the ten-year-olds is probably due to greater variability 
and some selection. A substantial correlation, moreover, of .62 + 
.02 for the 700 pupils is confirmed by one of .61 + .01 for the 829 
cases. 

Freeing the total correlations from the effect of age variation 
only slightly decreases correlation for the groups of one age, but 
halves it for the combined groups; the correlation for the 700 
comparatively unselected subjects is .30 + .02—a reliable, but by 
no means imposing, correlation. 

Apparently, differences in length of school attendance contrib- 
ute but an inconsiderable portion to differences in educational 
acquirement. 
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3. The Effect of Mental Age upon Educational Age 


The agreement is marked between mental and educational ages 
in Table IV, which epitomizes this relationship for all the groups. 
Pupils with a high mental score tend to have a high educational 
score, 


Tasie TV.—Toran anp PartraL CorreLations Betwren MENTAL AND 
EDUCATIONAL AGEs. 


Chronological 


Number Total Chronological Ao d 
A 3 ge an 
ge csee Correlation P. Er eee t yiiende es 

9 186 723 .02 AWE 711 

10 222 837 01 826 809 

alg 166 731 .02 727 701 

12 126 834 .02 833 832 

Restricted Total 700 .841 SO 781 710 
8 52 .740 .04 
13 77 Loyal .05 

Entire Group 829 .825 O01 .780 .770 


The total correlations for the groups of one age are remarkably 
high. That they are reliable is evidenced by the low probable er- 
rors of the coefficients. The size of the coefficients varies from 
.72 + .02 and .73 + .02 for the nine- and eleven-year-olds to 
.83 + .02 and .84 + .01 for the twelve- and ten-year-olds. The 
fact that these latter correlations for groups of homogeneous age 
are as high as the coefficients, .84 + .01 and 82+ .01, for the 
composite groups demonstrates that the impressive correlations are 
little affected by differences in age. 

The persistence of high correlations when the influence of vari- 
ability in age is controlled offers confirmatory evidence of this fact. 
For the groups of single age, eliminating age-variation leaves cor- 
relations ranging from .72 + .02 to .83 + .02. For each aggregate 
group, rendering age uniform leaves a surplus score of correlation 
of .78 + .01, a very significant correlation. 

The high correlation coefficients of mental and educational ages, 
uninduced by age differences, indicate that variation in mental 
endowment very materially affects variation in educational achieve- 
ment. They signify, also, that the Stanford Binet is a good meas- 
ure of the phases of intelligence necessary for scholastic success. 
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4. The Effect of Chronological Age upon Mental and 
Educational Ages and Length of Attendance 


The portion that chronological age contributes to mental and 
educational ages and to length of attendance for each group is 
shown in Table V by the total correlations of chronological age 
paired in succession with each of the other variables. 


Tasie V.—Totat CorrELATIONS BETWEEN CHRONOLOGICAL AGE AND MENTAL 
AGE, EpucaTionaL AGE, AND ScHooL ATTENDANCE. 


Number Mental Age Educational Age School Attendance 
Age of Total Total Total 

Cases Comsahion ol oe Cpeciation Ne ee Correlation tar 

9 186 197 05 51 05 317 04 

10 222 282 04 241 04 228 04 

11 166 117 05 141 05 235 05 

12 126 160 06 .109 06 147 06 

Restricted Total 700 498 .02 .603 .02 777 .O1 

8 52 —.119 .09 .076 .09 —.152 .09 

13 EC .015 .08 .009 .08 <OOU .07 

Entire Group 829 435 .02 012 .62 .830 .O1 


The degree of correspondence of chronological with mental age 
revealed in Table V is positive, but slight, in the single age-groups 
and is higher for the younger than for the older groups—ranging 
from .12 + .05 for the eleven-year-olds to .28 + .04 for the ten- 
year-olds. The relationship between chronological age and mental 
age is probably most naturally represented by the total correla- 
tions, which are .50 + .02 for the total of 700 and .48 + .02 for 
the entire group in which selection plays some part. This marked 
and reliable correlation of .50 for the relatively unselected subjects 
is in accordance with the facts known about the effect of age upon 
mental development. 

The correspondence of chronological age with educational age 
shown in Table V is low, with high probable errors for the homo- 
geneous age-groups; it ranges from .11+ .06 to 24+ .04. For 
the heterogeneous age-groups it is .60 + .02 and .57 + .02—hboth 
very considerable and reliable correlations. 

The total correlations between chronological age and attendance 
given in Table V decrease from .38 + .04 for the youngest pupils 
to .15 + .06 for the oldest. In the heterogeneous groups corréla- 
tion rises to .78 + .01 for the restricted, and to .83 + .01 for the 


~*~ 
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unrestricted total. These latter correlations for pupils of various 
ages conform to the well-known fact that during the period of 
compulsory attendance the length of time a child has attended 
school depends largely upon his age. 

In sum, then, comparison of the coefficients of educational age 
correlated with mental age, with attendance, and with age indicates 
that intelligence is much more powerful than either length of school 
attendance or age in determining educational acquirement. The 
contrast in the size of these coefficients is particularly marked in 
the small groups, where variability of age, which influences cor- 
relation, is largely controlled. 


5. A Test of Selection in the Groups 


Since the degree of selection of the subjects affects the size of 
correlation coefficients, it is important to apply an objective test of 
the selection involved. Now that the coefficients of variation and 
the coefficients of correlation have been presented, it is possible to 
make the empirical test of selection suggested by Godfrey Thomson 
and Rudolf Pintner.6 In an unselected group the correlation co- 
efficient of chronological and mental ages, they point out, should 
be approximately equal to the ratio of the coefficients of variation 
of chronological and mental ages. Table VI compares the relevant 
coefficients with the ratios for the four age-groups and for their 


total. 
TABLE VI. 


COMPARISON OF THE CORRELATION COEFFICIENTS OF CHRONOLOGICAL AND MENTAL 
AGES WITH THE RATIO OF THEIR COEFFICIENTS OF VARIATION 


Coefficients of Ratio of Coefficients 
Age Total Correlation of Variation 
9 .20 23 
10 .28 16 
11 12 17 
12 16 12 
Restricted Totai 
(700 cases) .50 53 


The correspondence of the coefficients of correlation with the 
ratio of the coefficients of variation, for chronological and mental 


*<<Spurious correlation and relationship between tests.’’ Jour. of Educ. 
Psych., 15:442. 
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ages, is so close for the composite group and for the single age- 
groups—except the ten-year-old group where the difference is .12— 
that, judged by this relationship, the 700 subjects are, as originally 
assumed, practically unselected. 

The fact that, according to this criterion, the ten-year-olds are 
the most, and the nine-year-olds the least selected, accounts in part 
for the respective higher and lower correlation coefficients of the 
two groups. In the former group more selection, reinforced by 
higher variability, increases the obtained correlation; in the latter 
group less selection, supported by lower variability, depresses the 
obtained correlation. 

Judged by this test, selection has played little part in deter- 
mining the degree of relationship between the variables studied. 


V. THE VALUE OF THE FACTORS IN THE PREDICTION 
OF EDUCATIONAL AGE 


1. The Regression Equations 


Partial correlation is valuable not only in freeing correlation 
of extraneous influences, which, by affecting the homogeneity of 
the subjects, distort the real relationship, but also in building re- 
gression equations upon which prediction may be based. 

The regression equation used in this investigation admits of 
prophecy as to a pupil’s educational achievement from knowledge 
of his intelligence rating, the length of time he has attended school, 
and his age. It combines these data so as to yield the most accu- 
rate educational score possible from a combination of the other data. 

The regression equations’ for the prediction of standard educa- 
tional scores for the ten-year-old group and for the restricted 
total of 700 pupils, are, respectively, 

% = .783 z+ .176 z,—.021 %, and 

%, = 692 z+ .171 2, -+ .126 % 


™Derived from the equation, 


a = B 12.34 Ze +- B 13.% Z3 + B 14.2 Z4 


o 
( Bus = 2 bun) 


z, designates a pupil’s standard educational score - 


and Z, Z, and z, respectively, designate his intelligence, attendance, and 
chronological scores, all expressed as standard scores. 


' 
gross score — mean ) 
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The coefficients of z,, z3, and z, (the regression weights) indicate 
the weights to be assigned to the intelligence, the attendance, and 
the age scores, respectively, in estimating a score in educational 
achievement from these three scores combined. 

The second equation gives the best possible forecast, based upon 
the 700 comparatively unselected subjects, of a pupil’s educational 
age when his mental age, the length of his school attendance, and 
his age are known. 

A comparison of the regression weights of this equation, .692 
for mental age, .171 for attendance, and .126 for age, shows that in 
the prediction of educational age the relative weights of the Binet 
score, of length of attendance, and of chronological age are 5.49, 
1.36, and 1.00, respectively. These values, however, must not be 
interpreted to mean that, independent of the other variables, mental 
age is four times as important as attendance and five and one-half 
times as important as age in determining educational achievement. 
The regression weights apply only to the relative weights of the 
other three scores in predicting an educational score; they do not 
apply to the relative independent contributions of intelligence, 
attendance, and age to school success. 


2. The Reliability of the Prediction 


The statistical reliability of a regression equation is determined 
by the probable error of estimate and by the multiple coefficient 
of correlation. 

The probable errors of estimate of the regression equations of 
this study® are 6.8 months and 7.6 months for the ten-year group 
and for the 700 unselected subjects, respectively. The chances are 
even that the error made in using a predicted instead of an actual 
educational score would be less than 6.8 months for a ten-year-old 
and less than 7.6 months for one of the group of 700. 

The coefficients of multiple correlation® for the ten-year-olds 
and for the restricted total are, respectively, .85 and .87. This 
means that if the educational ages of the 700 unselected pupils of 
this study are estimated from the regression equation, the correla- 


*Computed by the formula, 
PE (est. x,) = .6745 X 1, 284 
*Computed by the formula, 


o 
Ry (234) = i- _—* 


1 o*, 
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tion between the estimated scores and the scores which these pupils 
actually obtained when tested with the Stanford Achievement Test 
will be .87. 

The probable errors of estimate and the multiple coefficients 
of correlation of the foregoing regression equations indicate that 
the latter are probably more accurate instruments for estimating 
the educational scores of the pupils studied than the combined 
judgment of the best teachers, but that they fall short of the accu- 
racy necessary for extensive use in educational prognosis. 


3. The Predictive Indexes 


The value of the coefficients of correlation in prediction may be 
estimated from the coefficients themselves without the intervention 
of a regression equation. The predictive index (1—k), which 
gives the positive aspect of the coefficient of alienation,!® repre- 
sented by k (k= \/ 1—r?), may be utilized for this purpose. It 
shows the degree of correspondence between an obtained score in 
one of two correlated traits and a score in the same trait estimated 
from knowledge of a score in the other trait. 

For the group of 700 pupils the predictive indexes based upon 
the total correlations of intelligence, of attendance, and of age, 
with educational age are, respectively, .46, .22, and .20. The pre- 
dictive indexes of intelligence and of attendance with educational 
age, for uniform chronological age, are, respectively, .37 and .05; 
and the predictive index of chronological age with educational age, 
for uniform attendance, is .03. 

It is evident that individual educational scores cannot be esti- 
mated with accuracy from intelligence, attendance, or age data. 
It is equally obvious that intelligence is a better index of achieve- 
ment than attendance and age combined. 


VI. THE CoNcLUSIONS AND THEIR IMPLICATIONS 
FOR HDUCATION 
1. Summary of the Conclusions 
The interpretations which accompany the data presented in the 
preceding pages are summarized at this point to serve as a founda- 
tion for suggestions concerning the educational significance of.the 
results. 


* Kelley, T. L. Statistical Method, p. 173. 
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The conclusions can be generalized only in the light of the pre- 
liminary definitions and the admitted limitations of the study. 
Within these limits the data reported seem to warrant the follow- 
ing deductions for at least the 700 comparatively unselected pupils 
studied : 

1. Among the subjects of the same age there are great individ- 
ual differences in mental and in educational ages and in length of 
school attendance. These differences very probably represent ex- 
treme variation in both the nature and the nurture of pupils of 
uniform age. 

2. Of the homogeneous groups studied, the most variable group 
in length of school attendance is the least variable in mental and 
in educational ages; the least variable group in attendance is the 
most variable group mentally and next to the most variable edu- 
eationally. Apparently, differences in the amount of schooling of 
these pupils of single age do not account for differences in their 
mental and their educational development. 

3. Although the quantity of school instruction has varied much 
and the quality little, the negligible correlations between length of 
school attendance and mental age indicate that the striking in- 
dividual differences in the mental capacity of these unselected 
children of constant age are not appreciably due to variation in 
the amount of school instruction they have received. 

4, The low correlations between length of school attendance 
and educational age signify that the great individual differences 
in the educational acquirements of these unselected pupils of the 
same age are not to a significant extent the result of variation in 
the time they have spent in school 

5. The persistence of high correlations betweens the mental 
and the educational ages of the children studied, for constant con- 
ditions of chronological age, denotes a close and doubtless a causal 
relationship between their mental capacity and their educational 
development. The radical differences in the educational achieve- 
ment of these unselected subjects are probably due to still greater 
differences in their mental endowment. 

6. Variability in age has contributed much less than variability 
in mentality to the educational status of the pupils tested. The 
lower correlations of chronological age with mental and with edu- 
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cational ages for the homogeneous groups show that some of the 
correspondence for the composite age-groups is due to heterogeneity. 

7. The marked relationship between chronological age and 
length of school attendance indicates that the number of days the 
subjects have attended school has been largely conditioned by age. 

8. Comparison of the results of mental age, of length of school 
attendance, and of chronological age, correlated with educational 
age, particularly for the groups of uniform chronological age, shows 
that intelligence has contributed much more heavily than either 
attendance or age to the educational achievement of the children 
studied. 


2. The Educational Implications of the Conclusions 


If the data for the 700 practically unselected subjects are re- 
liable, and if the conclusions based upon the data are sound, there 
is a reasonable presumption that both are more universally ap- 
plicable. 

If the correlations obtained are reliable indexes of existing 
relationships between variation in time spent in school and both 
mental and educational variation, it is obvious that considerable 
instructional effort is now wasted and should be redirected along 
more fruitful lines. 

Since differences in mental capacity among children of a single 
age are practically uninfluenced by varying amounts of school in- 
struction, but are most influential in determining differences in 
educational acquirement, it is evident that the result of educa- 
tional effort is largely dependent upon the mental capacity of the 
pupils. Consequently, the school should take mental capacity into 
account in instructional planning. Recognition of the uncontrolla- 
ble, but controlling power of mental capacity in determining edu- 
cational development will result in readjustment of the whole 
educational program to individual capacity and needs. 

To this end the school should ascertain, by means of intelligence 
testing, the present mental status of each pupil and upon that 
should estimate within fairly broad limits his most probable edu- 
cational possibilities, limitations, and needs. Such diagnosis and 
prediction are essential for the proper classification of the pupil — 
and for the determination of the best educational treatment to 
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meet his present and his future needs. It is futile to give instruc- 
tion without due consideration of a pupil’s ability to profit by it 
and of his probable use for it. 

Fuller realization of the existence and consequence of wide 
mental variation for constant age will result in classification largely 
on the basis of intelligence and in the differentiation of courses of 
study and methods of instruction. Such provision for individual 
mental variation can be made without neglect of the important 
integrative function of elementary education in a democracy. The 
necessary integration can be secured through a common core of 
material for each of the differentiated courses in a subject of study 
and through classification in the ‘‘special’’ subjects and participa- 
tion in extra-classroom activities on bases other than intelligence. 

Not only should the school take account of the innate differ- 
ences which condition educational results, but the state, which pro- 
vides public education, should also take cognizance of them. 
Whether public education is regarded as an investment by the state 
to promote its own interests or to develop the highest type of 
individualized personality, since in either case the product is largely 
dependent upon mental endowment, public education cannot be 
better served than by the state’s promoting the science of eugenics 
and disseminating the facts learned in order to improve the mental 
equipment of school children. 

While the results presented indicate that there is little corre- 
spondence between variation in school attendance and variation 
in educational achievement, it should be noted that the correla- 
tions reported do not represent the absolute contribution of time 
spent in school to educational acquisition. In as much as nearly 
all the pupils studied were classified from the fourth to the seventh 
grades, inclusive, it is highly probable that some skills—for exam- 
ple, the mechanics of reading—and some facts—the multiplication 
tables, for instance—having been mastered by practically all the 
pupils tested, did not considerably affect variation in the scores, 
although they did contribute greatly to the individual scores. Had 
some children been tested who had attended school for only a 
short time or not at all, the lack of the educational elements com- 
mon to the pupils who had attended for some time would very 
materially have lowered their scores, thereby increasing variability 
and raising the correlation. 
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In considering the data reported, the limitations of the problem 
must also be kept in mind. Not all the ends of education have 
been measured, only information and skills, the narrower and more 
tangible objectives. Accurate means of measuring the acquisition 
of ideals, attitudes, and appreciations, the wider and more intangi- 
ble values of education, are not yet available. Hence this study 
has made no attempt to measure these broader phases of behavior 
and ean, therefore, shed no light on the extent to which the school 
is achieving these higher aims. 

Notwithstanding the foregoing extenuations, this investigation 
indicates that the school has made only a beginning in utilizing the 
mental capacity at its disposal. However largely educational ac- 
quirement may be prescribed by mental endowment, it is probable 
that instructional capacity and effort will always be taxed to the 
utmost to develop all the educational possibilities of all the pupils. 
Far from discouraging educational activity, therefore, the resulis 
of this investigation present a stimulating challenge to renewed 
and better directed effort. 
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PREFATORY NOTE TO CHAPTER IV 


This chapter may be considered by the reader along with the 
two that precede it. The evidence presented by Obrien indicates 
that the cumulative effect upon pupils’ school abilities of attend- 
ing school only seven, rather than eight, months a year amounts 
to very little, even in the course of eight years, although pupils 
attending eight-months’ schools have had, by the time they reach 
the eighth grade, the equivalent of one school year more of school- 
ing than those attending the seven-months’ schools. 

Obrien also reports practically no correlation for several hun- 
dred pupils between their percentage of attendance over a period 
of two and a half years and their scores on achievement tests; but 
this result cannot be clearly interpreted, owing to the fact, pointed 
out by the author, that a much larger percent of the group having 
lowest attendance are chronologically retarded. Thus, the pupils 
with poorest attendance records for the two and a half years, being 
older on an average than the classmates with whom they are com- 
pared, may actually have had more months or years of schooling 
than those with better attendance records. 

The rather striking conclusions of this study should not be inter- 
preted as meaning that it does not matter how long children go to 
school. To what extent the children in seven-months’ schools or 
the children with irregular attendance may suffer in the acquire- 
ment of ideals, ambitions, and personality or character traits not 
measured by tests, but cultivated in schools, this study ean, of 
course, give no inkling. If this aspect of school life is not kept in 
mind, the reader may find himself wondering what, after all, the 
real purpose of our schools should be. 
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CHAPTER IV 


THE CONDITIONAL VALUE OF A LONGER SCHOOL YEAR 
IN ONE-TEACHER SCHOOLS 


FB, P. OBRIEN 
Director, School Service and Research, 
University of Kansas, Lawrence, Kansas 


NATURE OF THE INVESTIGATION 


From data gathered in connection with a recent study of pupil 
achievement in one-teacher schools in Kansas a comparison was 
made of the achievement of 861 pupils attending one-teacher schools 
for eight months of each year with the achievement of 334 pupils 
who were attending one-teacher schools in the same region for 
seven months of each year. An analysis was also made of the 
effect on school achievement of different degrees of regularity in 
school attendance in these one-teacher schools. A total of 98 
schools in 14 counties located in different parts of the state were 
included. 


THe TESTS 


The tests employed in this study were: the Courtis Arithmetic 
Test in Fundamentals, Series B; Burgess Silent Reading Test, 
Form 1; Buckingham-Ayres Spelling Seale, List W, 20 words; 
Willing Composition Seale; Ayres Handwriting Scale; and the 
National Intelligence Test, Scale A, Form 1. All tests were given 
in Grades III to VIII, except spelling, which was given only in 
Grades VI, VII, and VIII. Uniform testing procedure was fol- 
lowed, and the work was done under close supervision. 


COMPARISON OF SEVEN-MontTH AND Hicgut-MontH SCHOOLS 


The pupils in the eight-month schools have had, by the end of 
the seventh grade, seven months more of schooling than the pupils 
who attended the shorter session. In Grades VII and VIII the 
pupils attending schools having the longer session showed a marked 
tendency toward higher achievement scores. Yet in no subject or 
grade was the difference equivalent to a half-year of progress. 
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Below the seventh grade the difference in median scores, as shown 
in Table I, favored the eight-month schools in only nine of the 
twenty subject and grade comparisons. There was no difference 
in two of the comparisons. 


TaBLE I.—ComparRIsSON OF THE Purius’ Mepran ScorES BY GRADES IN SEVEN- 
Monta AND E1igut-Monts ScHoous OF THE ONE-TEACHER TYPE 


Spelling Reading Composition 


Grade Seven- | Eight- | Seven- | Hight- | Seven- | Hight- 
Month | Month | Month | Month | Month | Month 
Schools | Schools | Schools | Schools | Schools | Schools 


DUT a Perseetarc ee votes enayste nea 2.3 1.8 16.4 16.3 
LIVER Srostesvoherehs lacoste at aah 6 Bad 22.6 20.6 
VER Rad ei he, Haat Bia: Sid 4.7 5.2 25.6 27.2 
AVL Were sent states Berl 33,011 5.2 6.6 29.1 29.0 
VE eeaetastetete were ee es 4.3 6.5 6.4 7.8 31.9 36.9 
ALVES Fae act AA eatin 10.0 9.5 8.5 8.6 34.0 37.8 
Number of pupils 190 439 295 759 334 861 
ETO Gan ee dean tie ies 629 1,054 1,195 
Grade Handwriting Addition Multiplication 
TAT Rete et tons Bc heehee 23.1 23.7 0.7 0.7 a 
LV sivas Ae ape Seay eee ese 23.2 24.8 iil 1.5 0.6 0.8 
Vitat abies carte aeet oe 26.3 29.0 1.6 LES 2.1 6 
Vil Drees sh comets 31.8 31.1 2.5 2.4 2.8 2.8 
AVA Wea aes Cac ae: 85.4 37.1 2.8 3.0 3.8 BAY 
AON ter enioe ats caioar 38.6 39.5 3.8 3.0 4.9 5.0 
Number of pupils 334 861 334 861 276 127 
Dotall eerie ste os 1,195 1,195 1,003 


RELATION OF ATTENDANCE TO ACHIEVEMENT 


The general bearing of the facts presented relative to the length 
of the school year is strikingly reinforced by an analysis of the 
relation of attendance to achievement of pupils in these one-teacher 
schools. The school attendance records were secured over a period 
of two and one-half years for 585 pupils in 63 one-teacher schools. 

In this analysis the pupils were classified into three attendance 
groups: those who had attended 90 percent or more of the time, 
those who had been present from 80 to 89 percent of the time, and 
those attending less than 80 percent of the time. These three at- 
tendance groups included 333, 149, and 103 pupils, respectively. 
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It is significant also that pupils in the highest attendance group 
generally had also the longer school year. 

The test results showed that the median achievement of each of 
the two lower attendance groups frequently equalled or surpassed 
the median accomplishment of pupils in the highest attendance 
group. In no grade was the highest attendance group superior 
in all the tests. Evidently, regular school attendance alone is not 
so important an influencing factor in school achievement in rural 
elementary schools as has ordinarily been assumed. Its value is 
at least conditioned by other factors. 

The correlation between the percentage of attendance of these 
pupils and their mental test scores was close to zero (r= 0.05). 
As might be expected, however, a much larger percent of the 
lowest attendance group was retarded chronologically and a smaller 
percent accelerated than was true in either of the other two groups. 


PREFATORY NOTE TO CHAPTER V 


Chapters V, VI, and VII all deal with the amount of effect pro- 
duced upon a pupil’s educational achievement by the environmental 
factors teaching ability or school methods. In the first of these 
chapters is to be found a type of study that is much needed, but 
very difficult to make. The problem is concerned with the relative 
contribution of a pupil’s intelligence and of the kind of teaching 
he receives to the scholastic progress he is able to make, in this case 
to progress in arithmetic fundamentals and reading comprehension. 

The problem is a difficult one to handle because we are not sure 
just what teaching ability really is, nor just how to estimate it; 
and because intelligence is a permanent attribute showing a cumula- 
tive effect throughout a child’s school career, whereas teaching in- 
fluence changes as the child moves from grade to grade. 

The first difficulty—that of estimating teaching ability itself— 
may possibly obstruct a complete study of the problem at hand 
until a valid method of evaluating teachers is devised by experi- 
mental means. Mr. Taylor limits his investigation, however, to a 
study of the effect of teaching ability as rated by school admin- 
istrators. Though admitting that such ratings may include ‘‘indica- 
tions of the general reputation a teacher bears for codperativeness, 
educational up-to-dateness, and disciplinary success—all of which 
may or may not be closely related to the measured achievement of 
the pupils,’’ he justifies his study by pointing out that at least the 
ratings secured are those commonly used to designate teaching 
ability, and that it is important and pertinent to investigate the 
influence of teaching ability thus rated. 

The difficulty due to the prolonged and cumulative influence of 
intelligence and of the short duration of the influence of the par- 
ticular teacher in question is partly met by computing a regression 
equation for predicting achievement in arithmetic and in reading 
from intelligence, overageness, teaching ability, and initial achieve- 
ment of the pupils in arithmetic and reading at the beginning of 
the experimental semester. Since initial achievement is itself a 
product of the previous influences upon learning of pupils, the 
regression coefficients of intelligence, overageness, and teaching 
ability must in consequence represent the effects of these variables 
only during the semester measured. As is evident from Tables II 
and III, estimated teaching ability has very little effect upon the 
achievement of pupils over a span of one semester. 

Although there may be some question regarding the logical 
basis upon which Mr. Taylor derived the reliability coefficients 
which he uses in the corrections for attenuation, it should be noted 
that the conclusions of the study would not be greatly altered if 
the corrected coefficients were omitted altogether. 
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CHAPTER V 


THE INFLUENCE OF THE TEACHER ON RELATIVE 
CLASS STANDING IN ARITHMETIC FUNDAMENTALS 
AND READING COMPREHENSION 


Howard TAYLOR 
University of Oregon, Eugene, Oregon 


There are at least two ways in which the influence of the teacher 
may conceivably be an important factor in determining the achieve- 
ment of students: (1) personality may have an inspiring motivat- 
ing effect; (2) instruction may help the learner use better methods. 
In this study it has been impossible to separate these two influences. 
Moreover, it may be that the ratings used to measure the capacity 
of each teacher to influence the achievement of her pupils are 
not so much ratings of teaching ability as they are indications of 
the general reputation a teacher bears for codperativeness, edu- 
cational up-to-dateness, and disciplinary success—all of which may 
or may not be closely related to the measured achievement of the 
pupils. Nevertheless, such ratings of teachers are commonly used 
as indications of a hypothetical teaching power, and it seemed 
pertinent to use them in our effort to analyze the factors under- 
lying pupil achievement in two subjects, arithmetic fundamentals 
and reading comprehension. 

For our study we used the performance of pupils during the 
first semester of the school year 1923-24 and compared it with the 
ratings of the teachers who were responsible for the instruction 
during the same period. The records available pertained to 105 
slasses in the ten different half grades (4B to 8A inclusive) of 
fine elementary schoolsof a medium-sized city. They comprised 
data on the following items: 

1. Final achievement in Arithmetic Fundamentals (Woody-McCall, 
Form I) and Reading Comprehension (Thorndike-McCall, 
Form IT) 


2. Initial standing in Arithmetic Fundamentals (Woody-McCall, 
Form I) and Reading Comprehension (Thorndike-McCall, 
Form I) 
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3. Standing in “Overageness” (Chronological Age) 
4. Standing in Intelligence (National Intelligence Tests) 
5. Teaching Ability to which each was exposed in arithmetic and reading 


In some eases, on account of departmental instruction, certain 
classes had one teacher for arithmetic and another for reading. 
The rating appropriate to the teaching ability of the actual in- 
structor has been assigned to each class in each subject. 

We need, before presenting the results, to explain the general 
logic of our investigations. At the beginning of the school year each 
teacher is faced with a roomful of children. During the semester 
she is undergoing a practical test of teaching ability. Each child in 
the room constitutes an ‘‘element’’ in the test. But the elements 
differ from one another. The initial capacity of each pupil, his 
intelligence, his age in comparison with his grade position, are all 
presumably related in some degree to his final achievement in skill 
and knowledge which his teacher for that year or term is striving 
to make as large as possible. The statistical treatment used in an 
analysis of this situation is based on the following reasoning: 

1. If any of the factors mentioned are important in determin- 
ing the final standing of a class, they should tend to be operative in 
similar fashion throughout the grade range 4B to 8A. The in- 
struction is similar and the effort and ability required of pupils 
is of essentially the same sort in each grade. All our measure- 
ments have been made, in fact, by the same standard tests through- 
out this grade range. 

2. The extent to which these factors, when properly weighted, 
will predict the final standing of a class with reference to the other 
classes of the same grade is important evidence of the significance 
of each factor independent of the others. 

3. The average performance of all the pupils in a class whose 
records in each variable factor are available may be taken as repre- 
sentative of the standing of each class in each variable. 


Obviously, it is not important to know which classes are farthest 
above or below the average of the whole group of 105 different classes 
in the 10 different half grades. What is desired is to ascertain the tenden- 
cies toward concomitant variation of these factors within each half grade 
about the mean of all such half-grade classes in the city. Thus, it is the 
deviation of each 4B class from the mean of all the other 4B classes in 
each variable which concerns us, In the same way, we wish to know 
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whether or not there is any consistent tendency for the other classes 
of each half grade to fluctuate about the average of each group of classes 
in the same half grade for each of the factors upon which data have 
been obtained. Therefore, the relative standing of each class in each 
variable has been measured from the mean of the 9 or 15 (or what- 
ever the exact number may be) similar classes in the same half grade. 
This assumes that if a 6A class is a given distance above the average 
of the other 6A classes in final ability in arithmetic fundamentals, it is 
an indication of exactly the same superiority in this kind of achievement 
as in the case of a 4B or 8A class which is an equal distance above 
the mean of the other classes in its own half grade. 

This assumption that the means of several series of measures of 
the same mental function are equivalent reference points from which 
to make measurements is a common statistical procedure and would 
seem to be entirely justified in this case.’ 

It is necessary to make these measurements of the distance each 
class is above or below the mean of the other classes of the same half 
grade, in essentially equivalent units. We have no way of knowing that 
the 5A class which is 6 test-score units above the mean of the other 5A 
classes in the city is as superior as the 8B class which is, say, 3 test- 
score units above the average of its half grade. Three test-score units 
above the mean of 8B classes may be indicative of greater, not less 
superiority than 6 test-score units above the mean of 5A classes. In 
such cases it is customary to consider some measure of the variability 
of the group as an equivalent unit of distance from the mean. Through- 
out this study the standard deviation of the group concerned has been 
used as the measure of distance above and below the respective means, 
which have been regarded as the zero points of each distribution of class 
averages in each half grade.’ 


Thus, starting with the records of each child in six different 
variables (initial and final achievement in arithmetic, initial and 
final achievement in reading, intelligence, and chronological age), 
the standing of each class has been determined from the records 
of those children in the class for whom there were complete records. 
The total number of children whose records constitute the original 
data of this study is 1968. Each class is represented by an average 
of the records of the children who composed it. In general, there 
were about 19 children per class for whom all the data were avail- 
able. In some cases classes were considerably smaller than this; in 
others considerably larger. Each class average was then expressed 
as a deviation from the mean of a distribution of similar class 


1 Kelley, T. L. Statistical Method, Section 34, p. 114. 


100 THE TWENTY-SEVENTH YEARBOOK 


averages, including all the other classes of the same half grade 
throughout the city school system. Each deviation was divided by 
the standard deviation of the distribution of class averages to which 
it belonged. Thus, essentially comparable measures of the various 
factors were secured for 105 elementary-school classes. 

In the same way the differences in teaching ability had to be 
expressed for each class teacher. The data were gained from two 
sets of rankings by the principals of each elementary school and two 
sets of ratings made by the head of the city research department, 
who was acquainted with the classroom work of all teachers. The 
principals’ first ranking correlated with their second, made over a 
year later, to the extent of 649+ .05. The two ratings made 
by the head of the research department correlated .883 + .02. 
The research department ratings (sum of two) correlated with the 
principals’ rankings (sum of two) .625+.04. This amount 
of correspondence indicates that to a considerable extent the same 
factors were operative in each set of estimates, but that the judg- 
ments were nevertheless made independently. 

Proceeding according to the reasoning outlined above, the rat- 
ing of each teacher of each class was compared with the ratings of 
all the other teachers of similar classes in the same subject and 
within the same half-grade. The average estimated teaching ability 
assigned to that grade was considered as a reference point, or base, 
from which to measure the relative degree of teaching ability which 
each class enjoyed. The standard deviation of the estimated teach- 
ing ability assigned to the different classes of each half grade was 
the unit in which these differences in teaching ability were 
measured. By this method comparable measures of the teaching 
ability effective in each class in each subject were computed. 
These measures were computed separately for the sum of the two 
ratings made by the head of the research department and for the 
sum of the two rankings made by the principals. 

It is important to know the reliability of these sets of measures 
used to compare each class and each teacher with the other classes 
and teachers of the same half grade in the city. The reliability was 
figured in this way. The records of each child in a class constitute 
a sample of the initial ability in each subject, the degree of ‘‘over- 
ageness,’’ the degree of intelligence, and the final achievement of 
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the class working under a given degree of teaching ability. If the 
standing of each class be determined from a random half of the 
pupils in it and this standing be compared with the standing of the 
class determined from the other half of the pupils in it, the com- 
parison will furnish evidence of the consistency with which these 
two samples measure the differences studied. Accordingly, the 
records of each alternate child in each class were grouped together 
(with approximately equal proportion of the sexes in the groups) 
and the two groups obtained were treated separately, as has been 
described. The self-correlation of these measures of class standing 
in each variable and the prophesied reliability? of their sum is 
shown in Table I. The probable errors of these estimated coeffi- 
cients have been computed from Shen’s formula.? N in all cases 
equals the number of classes compared, 1.e., 105. 


TaBLE I.—SELF-CORRELATIONS OF VARIOUS MEASURES 


Self- Estimated 


Correlation | Reliability | P- E- 
Binal tArithmeticneeracavascercecces tae . 7470 .855 + .02 
(Rin ail ARG si cinn g eagewee eee a) asap: kara oles .6068 155 + .03 
lanitia WATIGh Ove tl Grsaseratstrs cece tues cise .6347 3 Ie + .03 
Initial cadin gee ee Saris ac reelection shee .6110 .759 + .03 
Chronological age (Overageness)......... .6345 .776 + .03 
lartelhigencetpan tre aa tre eeiie re tctevae .6039 A (oes + .03 
Teaching Ability of Arithmetic Teachers 0123 .678 + .04 
Teaching Ability of Reading Teachers.... . 53909 .694 + .04 


The estimated teaching ability of each teacher as compared with 
that of the other teachers of the same subject in the same half grade 
was determined separately from the principal’s ranking and the 
research department’s rating. It is the correlation of these sepa- 
rate determinations which has been used to estimate the reliability 
of their sum in each case. The result indicates some shrinkage in 
reliability when the comparisons are restricted to teachers within 
the same half grade. The original correlation of these two basic 
measures of teaching ability was .625 + .04, when the prophe- 
sied reliability of their sum would be .769 + .03. After under- 
going the treatment necessary to this problem, where only differ- 
ences within each half grade are considered, the same variables show 


2 Kelley, Statistical Method, Formula 158, p. 206. 
* Eugene Shen, Jour. of Educ. Psych., 1924, p. 462. 
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a prophesied reliability of .678 + .04, and .694 +.04, respectively, 
for the slightly different relative array of teachers of arithmetic 
as compared with teachers of reading. When it is remembered 
that only the differences within each rather narrow half-grade 
range are involved, these reliability coefficients would seem to be 
fairly satisfactory. Further analysis was made by partial and 
multiple correlation. 

The final standing of each class in each subject relative to the 
other classes of the same half grade has been considered a criterion. 
The question is: ‘‘ What is the significance of each factor in pre- 
dicting which class will stand highest in final achievement?’’ There 
are four variables: initial standing in each subject, chronological 
age (overageness), intelligence, and teaching ability in each sub- 
ject; each of these is to’be correlated with the criterion. 


RESULTS FOR ARITHMETIC 


Table II shows the intercorrelations of zero order and such 
partial correlations of the 1st, 2nd, and 8rd order as are essential 
to the computation of the multiple correlation between the de- 
pendent variables and the ecriterion.* 

The third-order partials show the residual relationships when 
the other factors have been held constant so far as is possible by 
this mathematical technique. The regression coefficients b12.345, 
b13.245, ete., show the weighting to be attached to each factor in 
order to give the best possible prediction of the criterion. The 
theoretical importance of each factor in this prediction is shown 
somewhat better when we set Sigma 1 equal to Sigma 2 equals 
Sigma 3 equals Sigma 4 equals Sigma 5, and compute the regression 
coefficients. (Obviously, the actual units in which each variable 
was measured are not important.) The result indicates that, so 
far as the unique factors in each variable are concerned, the in- 
telligent class, the initially capable class, the over-age class, and the 
class with a teacher who is rated high tend to have some advantage 
in the struggle for above-average final standing in Arithmetic 
Fundamentals. Intelligence and high initial capacity would seem 
to be of about equal importance when net relationships within a 


“Huffaker, C. L., Jour. of Applied Psych., June, 1923, p. 135. 
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TABLE IJ.—PARTIAL AND MULTIPLE CORRELATIONS OF FINAL ARITHMETIO 
STANDING WITH INITIAL ARITHMETIC, OVERAGENESS, INTELLIGENCE, 
AND TEACHING ABILITY ASSIGNED TO ARITHMETIC CLASSES 


Sigma 1 = 1.894 Sigma 2 = 1.848 Sigma 3 = 1.817 Sigma 4 = 1.792 Sigma 5 = 1.775 


Final Arith, Initial Arith. Overageness Intelligence Teaching Ability 
Ta, = +.412 Ty3 —= —.093 Ty = —+-.333 Ty5 = +.144 
Tag = —.241 To, = +.426 To; = +.130 
T34 = —.626 Tsp = —.200 
4, = +.054 
First-Order Partials Second-Order Partials 
7T12.4=-+.317 112.45 = +.307 
713.2 = +.007 r 13.45 = +.200 
713.4 = +.157 r 14.23 = +.244 
714.2 = -+.191 715.23 = +.104 
115.2 = +.100 r 23.45 = +.071 
715.4 = +.134 r 45.23 = —.164 
r 23.4 = +.037 
r 25.4 = +.119 Third-Order Partials 
T 34.2 = —.597 r 12.453 = +.300 
rT 35.2 = —.211 113.452 = +.188 
T 35.4 = —.259 r 14.235 = +.266 
tT 45.2 = —.002 r 15.234 = +.150 
Ri (23453) =.486 
Sigma 1.2345 = 1.6747 Sigma 4.1235 = 1.2302 
Sigma 2.1345 = 1.5760 Sigma 5.1234 = 1.6792 


Sigma 3.1245 = 1.3344 


b12.345 = .2188  or=.311 if sigmas are set equal 
b13.245 = .2359 or = .226 if sigmas are set equal 
b14.235 = .3621 or =.343 if sigmas are set equal 
b15.234 = .1496 or=.140 if sigmas are set equal 


single semester only are considered. Over-ageness is less important 
than these and a highly-rated teacher is still less important. 
Superficially, it might be argued that the final standing of an 
elementary-school pupil in arithmetic computation is the summa- 
tion of N segments of achievement (where N equals the number 
of semesters of school life) in each of which about the same pro- 
portion might be expected to vary concomitantly with initial score, 
overageness, intelligence, and teacher influence. Hence, it might 
be claimed that intelligence and initial score would always predict 
the major portion of the total arithmetic achievement. Thus, no 
matter how important the sum of N such semesters of teacher influ- 
ence might be, there would be N much larger segments of arith- 
metic achievement paralleling initial standing and intelligence. 
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There are two fallacies in this point of view.® (1) Initial capac- 
ity and probably other variables in this regression equation will 
not maintain their present relative importance throughout the 
period of elementary-school life. (2) The regression equation is 
an instrument of prediction, not a process which coagulates or 
which centrifuges the constituent independent elements of a con- 
glomerate mass. Making allowance for the first type of inaccuracy, 
one is forced to admit that if intelligence includes general potenti- 
ality, there must be a time of zero initial ability in arithmetic com- 
putation. Hence, there must be a time in the early life of the child 
when initial capacity is no indication of his final capacity. Each 
semester’s increment of ability becomes the initial capacity for the 
next semester. These data merely show that by the time classes 
get into the later years of elementary-school life (grades 4-8) what 
they have achieved to date is an important indication of what they 
will achieve in the following semesters. In the same way, it might 
be argued that the importance of relative retardation as a positive 
indication of final achievement is not constant throughout the ele- 
mentary-school period. Presumably, the variable called intelligence 
is fairly constant in its relation to achievement, and it may easily 
be that teacher influence is approximately constant or perhaps 
even greater in the earlier, more plastic years of elementary school- 
ing than in the final years which have been considered here. Thus, 
we cannot safely assume that the relative importance of these fac- 
tors in final achievement during a single semester will be main- 
tained throughout the other semesters of school life. 

Still more important in the correct interpretation of these data 
is the technical nature of the regression equation. It affords an 
analysis of the relative predictive importance of various factors 
with reference to a criterion. The principles which determine the 
predictive importance of each variable can be simply stated. 

(1) The more accurate the measure (reliability), the greater 
its predictive significance. 

(2) The higher the correlation of any measure with the cri- 
terion, the greater its predictive significance. 


°I am indebted to Professor T. L. Kelley for calling my attention to the 
primary importance of these considerations in this particular problem. He 
should not, however, be held responsible for such errors as may have crept into 
my use of them in the following interpretation. 
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(3) The lower the intercorrelations of any measure with all 
the other measures, the greater the predictive significance. 

Thus, it is a sort of average predictive significance, after due 
allowance for each of these considerations, which is indicated by 
the regression coefficient of each variable. We may now attempt 
to estimate roughly what would happen if we had sufficient infor- 
mation to build up a regression equation to predict final achieve- 
ment at the end of 16 semesters of elementary-school instruction. 

There are no very large differences in the accuracy of these 
various measures (Table I). Hence, while all our measures pre- 
dict somewhat better than is apparent from the relationships given 
—partly due to the unreliability of the criterion and partly to the 
chance inaccuracy in the measures themselves—none of them would 
gain notably over the others for this reason. But our measures of 
teacher influence are somewhat less reliable than the others, hence, 
teacher influence would gain the most in true relative importance 
from this standpoint. 

As has been shown previously, the correlation of initial ability 
with the criterion, final standing, would tend toward zero in the 
first semester and probably tend to increase somewhat with each 
additional semester. What is called ‘initial ability’ in these data 
includes all the previous teacher and other influences which have 
been at work. Hence, the average correlation of initial ability with 
final standing for the whole sixteen semesters would probably be 
somewhat smaller than that found for the one semester among 
upper-grade classes. Moreover, a part of the correlation with the 
eriterion which would be lost by initial ability would in all proba- 
bility be gained by the factor, teacher influence. It might be sur- 
mised that the same thing is to some extent true of overageness as 
a factor in final achievement. Presumably, intelligence and teacher 
influence would at least maintain their respective relationships to 
the criterion throughout the sixteen semesters. 

But the application of the third principle will produce very 
important changes. The average intercorrelation of initial standing 
for each of sixteen semesters of school life will be fairly high. So 
also for overageness and intelligence. Hach additional semester 
will add only slightly then to the predictive significance of these 
factors as indications of the total] elementary-school achievement. 
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But except for teachers transferred to a higher grade along with 
their classes and except for the effects of departmental instruction 
where the same teacher had the class through several years, there 
is probably no correlation between teacher influence in successive 
semesters. Each semester of teacher influence is an almost entirely 
unique indication of final achievement. We could expect, then, 
very radical changes in the relative significance of these four vari- 
ables as indications of final standing if we consider the interrela- 
tionships of all sixteen semesters of elementary-school life instead 
of the relationships obtaining during a single semester of instruc- 
tion. In fact, it is conceivable that the cumulative influence of all 
the different elementary-school teachers with which each child or 
class comes into contact would have greater weight in a regression 
equation for the prediction of total final achievement than any 
other of the four factors studied. Even so, it is worth noting that 
this relatively stable characteristic of a child or class which we 
eall ‘intelligence’ would equal in importance the influence of several 
teachers at least. 

It may be, moreover, that the ratings do not identify fully the 
truly able teacher of arithmetic, as measured by the changed rank 
of her pupils in final, as compared with initial, standing. Some- 
thing causes a very marked change in relative class standing be- 
tween the beginning and end of a school semester. The multiple 
correlation R1(2345) = .486 =.05 indicates that only a very mod- 
erate proportion of these factors are involved in the four variables 
considered. The alienation coefficient k, (2345) =.874 indicates that 
there are factors which, if known, might be expected to correlate to 
that extent with final achievement. Thus, in terms of the relation- 
ship r? +k?=1.00, we may say that 76.4 percent of the total 
determining factors in final class standing in arithmetic remain 
undiscovered while these four variables contain 23.6 percent. 

If allowance is made for the reliability of the criterion® the 


result is r,infinit atass .525, in which case 27.6 percent of 


the total determining factors are known and 72.4 percent unknown 
from the variables under consideration. 


* Kelley, Statistical Method, Formula 153a. 
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RESULTS FOR READING 


The intercorrelations and essential partials of the four variables 
with final class standing in reading as a criterion are shown in 
Table IIT. 


TABLE IIT.—PARTIAL AND MULTIPLE CORRELATIONS OF FINAL READING WITH 
INITIAL READING, OVERAGENESS, INTELLIGENCE, AND TEACHING 
ABILITY ASSIGNED TO READING 


— 


Final Reading Initial Reading Overageness Intelligence Teaching Ability 
Sigma 1 = 1.799 Sigma 2 = 1.801 Sigma 3 = 1.817 Sigma 4 = 1.792 pe ea ete 
Ty, = +.831 Ty; = —.525 Yu, = +.714 Ty; — —.050 
To3 = —.617 To = +.772 To5 == —.148 
Ty, = —.626 T35 = +.160 
Tus = —.219 
First-Order Partials Second-Order Partials 
r 12.4 = +.623 r 12.45 = +-.626 
713.2 = —.028 713.45 = —.150 
r13.4 = —.143 714.23 = +.205 
714.2 = +.204 r 15.23 = +.135 
715.2 =+.132 rT 23.45 = —.280 
715.4 = +.155 rt 45.23 = —.147 
rT 23.4 = —.270 
r 25.4 = +.034 Third-Order Partial 
T 34.2 = —.299 rT 12.453 = +.615 
r 35.2 = +.089 r 13.452 = +.034 
r 35.4 = +.030 r 14.235 = +.230 
rT 45.2 = —.166 r 15.234 = +.171 
Ry (cus) = .844 
Sigma 1.2345 = .9643 Sigma 4.1235 = 1.0465 
Sigma 2.1345 = .8663 Sigma 5.1234 = 1.7443 


Sigma 3.1245 = 1.3591 


b12.345 = .6846 or .686 if sigmas are set equal 
b13.245 = .0241 or .024 if siamas are set equal 
b14.235 = .2119 or .211 if sigmas are set equal 
b15.234 = .0945 or .095 if sigmas are set equal 


Although in the zero-order correlations there was a slight nega- 
tive relationship between final achievement in reading and teaching 
ability (1,, = —.049), this was probably due to an apparent tend- 
ency to assign initially poor and less intelligent classes to good 
teachers (r,, = —.148 and r,, = —.219). And indeed, the residual 
correlation of estimated teaching ability with final class standing in 
reading is positive and significant, though somewhat smaller than 
the residual relationship of intelligence to the criterion. Turning 
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to the regression coefficients, it is clear that initial ability com- 
prises by far the most important group of unique factors on which 
to base a prediction of final class standing in reading; that retarda- 
tion as measured by chronological age, though positive, is insig- 
nificant as a factor. Intelligence has a weight of only one-third 
that of initial standing; and teaching ability, a weight of about 
half that given intelligence. The theoretical weights secured by 
setting Sigma 1 equal to Sigma 2 equals Sigma 3 equals Sigma 4 
equals Sigma 5, are practically identical with the others. 

Here, again, the interpretation of the facts is subject to the 
considerations already stated with reference to the previous regres- 
sion equation. Much of the importance which initial standing has 
aS an indication of the next semester’s achievement in reading 
would disappear in an equation to predict total elementary-school 
achievement in reading. Indeed, no small portion of it would prob- 
ably gravitate toward teacher influence. And, on account of the 
much lower intercorrelations of successive semesters of teacher in- 
fluence, that variable would tend to accumulate weight more than 
the others. Hence, here again, teacher influence would probably 
acquire more weight in a regression equation for the prediction of 
all sixteen semesters of achievement in reading than would any of 
the other factors considered. However, it would probably require 
the influence of more than a single teacher to match, in significance 
for final reading achievement, the personal characteristics sub- 
sumed under the head of intelligence. 

In the case of final class standing in Reading Comprehension, 
the multiple correlation is Ri(e345) = .844, and ki 2345) = .536. 
Thus, the four factors under consideration give a much better pre- 
diction of final standing in Reading Comprehension than of final 
standing in Arithmetic Fundamentals. Indeed, the proportion of 
known to unknown factors is almost exactly reversed ;’ 71.2 per- 
cent of the total determining factors in final class standing in 
reading are given by the four variables under consideration, while 
28.8 percent remain undiscovered. If the reliability of the eri- 
terion is taken into consideration, this particular combination of 
four factors would predict a true criterion of final class standing 


esi .971, or 94.3 per- 
V «155 


in reading to the extent of r,infinity = 


‘ Utilizing the relationship r? + k? = 1.00. 
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cent of the total determining factors to 5.7 percent of unknown 
factors. Thus, it is clear that a true measure of class standing after 
a semester of instruction in reading could be almost perfectly pre- 
dicted from a knowledge of initial ability, overageness (C.A.), in- 
telligence, and the teaching ability to whieh the class was exposed. 
We can be sure, therefore, that our ratings have not failed to 
identify most of the teacher qualities which are involved in the 
relative reading achievement of these classes. Even if all the un- 
known factors were simply unrecognized teaching talent, the influ- 
ence of a single upper-grade teacher would remain in general a 
minor factor in reading achievement. In computation ability this 
is not so certain. It is quite conceivable that the unknown factors 
in this sort of arithmetic achievement are in some degree specialized 
characteristics of good arithmetic teachers which are unnoticed in 
such ratings as have been used. 


CoNCLUSION 


In conclusion it might be said that the final standing of an ele- 
mentary-school class in relation to the other classes of the same half 
grade in a representative school system varies directly as the initial 
standing of the class is high or low, as-the class is relatively over- 
age for the grade or underage, as it is intelligent or unintelligent, 
and as it is taught by a teacher who is esteemed by her superiors 
or by one whose competence is not so well attested. Final 
achievement is a complex in which many factors are blended. 
Everything counts, but nothing superlatively. Teachers are not 
miracle workers. In general, pupils and classes are not trans- 
formed in one semester or even in several. But there does seem 
to be a sort of differential pressure varying with the estimated 
ability of the teacher which, quite independently of other factors, 
parallels the variations in final achievement. It effects only very 
slightly the outcome of each semester of instruction, but its unique 
character suggests that the sum total of teacher influence on a 
given child or class would constitute very important data for the 
prediction of total elementary-school achievement. This unique- 
ness (approximate zero correlation between successive semesters or 
years) of teacher influence is probably the reason it is difficult to 
identify as a factor in achievement under ordinary school condi- 
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tions of changing teachers. In general, each child or class in the 
course of an eight-year period is exposed to approximately average 
teaching ability, since there are about as many poor as good teach- 
ers, and the effects of inferior instruction tend to cancel the effects 
of superior teaching in each individual case. Thus ‘‘quality of 
teaching received’’ is a fluctuating possession of a child or class, 
one almost impossible to identify, whereas intelligence and attain- 
ment-to-date, being present each year to about the same degree, are 
increasingly stable possessions which can be readily recognized 
and evaluated. 

Of all the environmental influences by which we hope to im- 
prove humanity we are prone to depend most upon the teacher. 
Is it not significant that, as early as in the upper grades of the 
elementary school, if we consider only such relative achievement 
as actually takes place during one semester of instruction in read- 
ing comprehension, the influence of the teacher affords only a minor 
indication of the final standing in comparison with the influence of 
previous attainment and is far less important than that character- 
istic of each individual child (intelligence) which has attained such 
stability at three years of age as to justify predictions of future 
achievement? Nature and nurture seem to be inextricably inter- 
twined, but he who would change markedly an individual’s relative 
standing in school achievement by such teacher influence as has 
been measured here must not only begin early, but he must also 
insure a continuity of superior instruction. The line between 
potentiality and achievement cannot be clearly drawn. But when 
early achievement becomes so heavily fraught with the promise 
of what final standing will be, and such highly esteemed environ- 
mental factors as the teacher’s influence give an indication of final 
semester standing which is slight in comparison with that afforded 
by measures of the mental organization that has already come to 
pass, are we not forced to think of each new environmental influ- 
ence as a factor of diminishing import in comparison with the indi- 
vidual as he is at any but the very earliest stages of development? 


PREFATORY NOTE TO CHAPTER VI 


The problem of the following chapter is the relative importance 
of intelligence and grade placement as determiners of school 
achievement. Van Wagenen compares (a) groups of boys of the 
same mental age, but two years apart in grade placement with 
(b) boys in the same school grade, but two years apart in mental 
age. He finds a larger disparity of school achievement in the latter 
comparison for all the school abilities tested except history informa- 
tion—a subject more emphasized in the higher grades. In reading, 
the disparity in the second comparison was more than two and 
one-half times as large as in the first comparison, and in the 
‘thought phase’ of history the ratio of the disparities was nearly 
as great. 

It is hardly possible to state the results of this study in 
numerical terms of the relative importance of intelligence and grade 
placement in conditioning achievement—although intelligence obvi- 
ously ranks above grade placement. Neither is it possible to con- 
clude (at least from data in this paper) that intelligence, as it is 
here measured (on the Terman Group Test of Mental Ability), is 
native rather than trained, or partly trained. But the study 
is none the less valuable in pointing which way the wind blows. 
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CHAPTER VI 


GRADE PLACEMENT VERSUS MENTAL AGE AS A 
FACTOR IN SCHOOL ACHIEVEMENT 


M. J. VAN WAGENEN 
University of Minnesota, Minneapolis, Minnesota 


Wherever mental tests have been given in elementary or high- 
school classes, wide variability in the mental age of the pupils has 
been the condition commonly found. It is not unusual to find in 
the same class pupils differing in mental age by as much as four 
years. Nor are the variations in attainment in any phase of school 
work usually discovered to be much less. Yet to many school people 
these conditions seem far less disturbing than wide variations in 
ehronological ages. The latter condition results, of course, when- 
ever gifted pupils are allowed to skip grades or backward pupils 
are required to repeat grades. Where ability grouping is feasible, 
this condition, however, can be largely eliminated. But the fact 
that many backward pupils are still likely to be found in grades 
in advance of their mental ages, while many gifted pupils are dis- 
covered in grades far below where their mental ages would place 
them, raises the question as to the comparative importance of grade 
placement and mental age in school attainment. Gifted pupils not 
infrequently do a poor quality of work in comparison to their 
capacity and the backward, as a rule, surpass them in relation to 
capacity. Is it, then, superior attainment of the less gifted, and 
low attainment of the gifted pupils, that accounts for the wide 
variability in attainment and mental ages in the classes? 

Comparison of the mid-attainments of several hundred boys of 
corresponding mental ages in the sixth and eighth grades of a large 
school system affords a very definite answer to this question, and 
also indicates the relative importance of grade placement in rela- 
tion to attainment in contrast with capacity to achieve as measured 
by mental tests. 
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In Table I are given the mid-scores made in eight achievement 
scales? by all the sixth-grade boys with mental ages ranging from 
12 years, 3 months, to 15 years, 9 months. The same tendencies are 
also evident among the mid-scores of the girls, but the sex differ- 
ences are too great in several instances to warrant combining the 
original scores for comparative purposes. For this reason, the 
scores of only the 333 sixth-grade boys and 486 eighth-grade boys 
have been used as a basis for attacking the problems suggested. To 
obtain more reliable mid-scores, the pupils were sectioned into 
groups each covering a range of three months in mental age, even 
though this procedure slightly minimized the differences among the 
mid-scores. 

A glance down any column in Table I shows a decided increase 
in the mid-scores corresponding to the consecutive mental-age in- 
tervals for every phase of instruction measured. There are rever- 
sions, of course, but these should be expected as a matter of chance 
with so few cases in the consecutive intervals. The general tend- 
ency, however, is too marked to leave any doubt. While some back- 
ward pupils were undoubtedly promoted more rapidly than their 
mental ages would suggest and some gifted pupils were retarded 
because of failure to achieve, this was certainly not true of the 
group asa whole. The groups of pupils with the lower mental ages 
are the groups with the lower mid-scores. Likewise, the higher mid- 
scores were made by the pupils with the higher mental ages. A 
comparison of the mid-scores made by the sixth-grade boys with 
mental ages between 12 years, 3 months, and 12 years, 5 months, 
inclusive, with those made by the boys with mental ages of 15 
years, 3 months, to 15 years, 5 months, inclusive, shows some 
striking differences. In spelling the two groups vary by more 


+The mental tests and educational scales used in making the measurements 
were those of the series that have been rather widely used in recent achieve- 
ment surveys; namely, 
The Terman Group Test of Mental Ability, Examination A. 
Spelling Scale A of the Van Wagenen Spelling Scales. 
The Thorndike-McCall Reading Scale, Form 7. 
The Van Wagenen American History Information Scale R. 
The Van Wagenen American History Thought Scale R. 
The Posey-Van Wagenen Geography Information Seale R. 
The Posey-Van Wagenen Geography Thought Scales R and S. 
The Woody Arithmetic Scale R (Van Wagenen revision) for the funda- 
mental operations. 
The Buckingham Scale for Arithmetic Problems, Form II, with the C 
score method of scoring. 
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TasBLE I.—Muinp-Scorss or 335 Srxta-GrapE Pupits CLASSIFIED BY 
MENTAL-AGE INTERVALS OF THREE-MONTHS 


Geogra- Arith- 


Mental History 5 Geogra- : Arith- 
Ages Spelling | Reading Infor- aaa eee phy ee metic No. 
mation mon Thought mientals Problems 

12_3 48 56 68 67 68 71 61 50 
12_6 48 54.5 69 66.5 67 72 62 56 
12_9 48 55.5 72 69 69 78 66 56 
13_0 53 58.5 75.5 71 71 73 65 37 
13_3 51 57.5 72.5 72 70.5 76 69 33 
13_6 55 63 77 74 13 75 69 23 
13_9 55 61 79 75 74 77 64 21 
14_0 55.5 63.5 79.5 75 76 77 72 19 
14.3 53 64.5 75.5 73 75 82 71 8 
14_6 55 63 83.5 75.5 76.5 92 75 

14_9 59.5 67.5 88 79 76.5 79 72 8 
15_0 61 64.5 81.5 75.5 78 93 82 4 
15_3 59.5 68.5 71.5 77.5 79.5 83.5 74 4 
15_6 61 70.5 88.5 83.5 78.5 82.5 82 4 
15_9 65 70.5 87 79.5 78 83 73 5 


Tasie II.—Mrp-Scorzs or 486 Eraurs-Grape Purits CLAssiFIED BY 
Mentat-Acs INTERVALS OF THREE-MONTHS 


Geogra- Arith- 


Mental : : History History | phy Geogra-| metic | AT ith- 
Ages Spelling | Reading Infor- Thought! Infor- phy Funda- | metic No. 
mation aaa Thought enantals Problems 
12_3 53.5 52 66 72.5 68 65.5 59.5 66 10 
12_6 58 54 66.5 71.5 64.5 67 74 67.5 14 
12_9 57.5 54 74 78 68 71.5 79.5 77.5 22 
13-0 56 54 71 77 69.5 70 78 73 20 
13_3 59 55 72.5 77.5 72.5 70.5 74 74 27 
13_6 57.5 55 71.5 80.5 1h 71.5 76 75 36 
13_9 59.5 57 78 83 74.5 73.5 77.5 70.5 36 
14_0 57 57 73 83.5 72.5 73 82 82 58 
14_3 58.5 59 76 85 74 75 80 77 41 
14_6 59 60 75.5 83 74.5 75.5 81 82 42 
14_9 61 63 75.5 87.5 75.5 75.5 82 83 47 
15_0 60.5 63 76.5 89 76.5 75.5 80 84 39 
15_3 60.5 | 61 10h 89.5 Ut 79 83.5 83 46 
15_6 64 63 Teh 88.5 77 79 86.5 85 24 
15_9 62 65 80 90.5 78.5 81 85.5 87 24 


than a year, and in the thought phase of geography and the funda- 
mental operations of arithmetic, by nearly two years. In the in- 
formation phase of both American history and geography they 
vary by nearly three years, while in reading and thought phase of 
both American history and arithmetic they vary by more than 
three years. Thus, the two grades differed more widely in three 
phases in instruction than in mental age. 
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An inspection of the columns of Table II, which contains similar 
data for the eighth-grade boys of the same school system, reveals 
exactly the same tendencies. In every phase of work measured 
except their ranges of information in American history, there is a 
difference of two grades or more in the mid-scores of the 12 year, 
3 months-old group and those of the 15 year, 3 months-old group. 
Even in the information phase of American history there is a dif- 
ference of nearly two grades, while in the thought phase there is a 
difference of three grades. In abilities to solve arithmetic prob- 
lems the difference between the mid-scores of the two groups is 
even wider than the differences in mental ages. Very clearly, the 
gifted pupils are not held back because of low attainments, but in 
spite of high ones. Likewise, the backward pupils are pushed 
along more rapidly than their attainments would warrant. Both of 
these conditions may be advisable when the pupils are grouped into 
homogeneous classes. But one at once questions whether this tend- 
ency in the normal class may not account for the fact that the rela- 
tive quality of work of the gifted pupils is commonly below the 
average for all pupils and much below that of the backward pupils. 
While this question can be definitely settled only on the basis of 
experimental evidence, further analysis of the data in Tables I 
and II will show the relations of grade placement and mental age 
to school attainment. 

As one compares the corresponding columns in Tables I and II 
for boys of the same mental age, the mid-scores of the eighth-grade 
pupils are for most groups higher than those of the corresponding 
sixth-grade groups. To some extent, of course, this is to be ac- 
counted for by the fact that the pupils with the higher attainments 
for their mental ages are undoubtedly more likely to be promoted 
or allowed to skip grades. How much is due to this factor cannot 
be determined from these measurements. But even if this is re- 
sponsible for only a small amount of the difference, grade place- 
ment cannot be a very important factor in school achievement. 
As one compares the corresponding columns, mid-secore by mid- 
score, the differences are found to be small, except in the case of 
range of information in American history, a phase of instruction 
upon which much emphasis was placed in the eighth grade of this 
school system. In most instances it is the similarities, rather than 
the differences, that are striking. 
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In Table III a qualitative comparison is made between the 
differences in the attainments of sixth- and eighth-grade pupils of 
corresponding mental ages and the differences in attainments of 
pupils of the same grade but differing by two years in mental age. 
Each item is an average of from seven to fifteen differences, and 
hence is a reasonably reliable basis on which to evaluate the 
tendencies. In the case of spelling the average two-year mental- 
age differences in Column III are just as great as the average 
sixth- and eighth-grade differences. In ability to answer questions 
on what is read, as measured by the Thorndike-McCall reading 
Seales, the average two-year mental-age differences are more than 
two and a half times as great as the average difference between 
sixth- and the eighth-grade pupils of the same mental age. The 
ratio is nearly as great in the thought phase of American history. 


Tasxie III].—Comparison oF Two-YEAR AND Two-GRaDE DIFFERENCES 


Average of dif- | Average of dif- | Average of dif- | Average of dif- 
ferences for each | ferences for each | ferences for each | ferences between 
of theseven two- | of theseventwo-]| of the fourteen mid-scores of 
year intervals year intervals | ‘two-year inter- | Grades VI and 
of mental age of mental age vals of mental | VIII for each of 
in Grade VI in GradeVIII age in Grades | the fifteen men- 
i VI and VIII | tal age intervals 


Spelling: 32 ster <\ejs\ase eleiolsitere 4.5 3.5 4.0 3.9 
REACIN GH. 2. ciotyers 5 surala.eh 8.0 hate 7.9 3.0 
History information...... 9.0 5.3 7.2 11.4 
History thought.......... 8.9 10.4 9.7 4.6 
Geography information... VA!) 6.4 6.7 —1.2* 
Geography thought....... 7.0 7.3 Ulges —0.5* 
Arithmetic fundamentals. . 10.4 8.6 9.5 —1.0* 
Arithmetic problems...... 10.4 10.0 10.2 7.6 


*The minus sign indicates that on the average pupils of the eighth grade were below those of 
the same mental age in the sixth grade, even though they had studied geography during th® 
seventh grade and the fundamental operations of arithmetic during both the seventh and eighth 
grades, 

Even in the information phase of American history, where the 
marked emphasis upon American history in the eighth grade would 
lead one to expect much greater grade differences than mental age 
differences, the average grade differences are less than sixty percent 
greater than the average two-year mental-age differences. Although 
the eighth-grade pupils apparently know less about either the in- 
formation or thought phases of geography than do the sixth-grade 
pupils of the same mental ages—in spite of the fact that they 
studied it intensively throughout the seventh grade—the pupils of 
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either grade who differ by no more than two grades in mental age 
differ by more than a grade? in attainments in all four cases. Just 
as in geography, the sixth-grade pupils stood higher in the ability 
to do the fundamental operations of arithmetic than did the eighth- 
grade pupils of the same mental age. The difference between the 
two-year mental-age levels was more than a year and a half in 
attainment. This should not be interpreted to mean that pupils 
do not make any gains in geography and the fundamental opera- 
tions of arithmetic from the sixth grade to the end of the eighth 
grade. They make very nearly normal gains, but in the eighth 
grade the same pupils will on the average be mentally two years 
older than in the sixth grade. It is the difference in mental age, 
rather than the difference in grade placement, that is evidently 
responsible for the differences in attainment. This does not imply 
that gains would not have been made without instruction, but it 
does suggest that mental age is far more important in determining 
school attainment than either the differences in instruction or the 
differences in content from the sixth grade up through the eighth 
grade. The pupil’s intellect, present abilities, and ranges of in- 
formation are evidently far more significant than his grade place- 
ment. It is possible that too high a grade placement in comparison 
with mental age may serve as a stimulus to the less gifted pupil. 
Of this there is no evidence. That too low a grade placement may 
fail to stimulate the gifted pupil is not at all improbable. 

This conclusion may seem open to several criticisms. The 
mental-age intervals for the higher levels may be inaccurate. Even 
if this were true, it would not invalidate the results. It is far 
more likely that the probable errors of the individual scores tend 
to minimize mental-age differences more than inaccuracies of mental 
ages would tend to exaggerate them. [Even the crude mental-test 
scores would demonstrate the tendencies. 

Again, it may be objected that mental age is but another index 
of school achievement. It is possible that school work, and espe- 
cially school attendance, may have some effect upon the mental-test 
scores. Yet, even were this demonstrated, it would not affect the 
validity of these results. Since pupils with the lower attainments 

2In all phases of instruction except spelling and reading a difference of 


6 points represents the normal gain during a grade. In spelling and reading 
the normal gain for a pupil of average intelligence is 4 points. 
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tend to have still lower mental ages and pupils with the high attain- 
ments still higher mental ages, the effect of school work upon mental 
age, apart from the general functioning of intellect, is probably 
slight. In view of the fact that the pupils with the higher mental 
ages in any grade are the youngest chronologically and have 
skipped, rather than repeated, earlier grades, and that the pupils 
of the eighth grade have, in addition, attended school two years 
longer, whatever effect school work may have had in raising the 
mental age of the more gifted pupils of the sixth grade is more 
than offset by their shorter period of attendance at school. 

Still a third criticism may be cited: namely, that the scales used 
in measuring attainment test what the pupils acquire outside of 
school rather than what is acquired in school. This probably could 
not be said of spelling and almost surely not of the fundamental 
operations of arithmetic; yet in the former the gain is the same 
for the difference of two years of mental age as for the difference 
of two school grades. In the case of the fundamental operations 
of arithmetic the average difference for two years of mental age 
is exceeded by only two other average mental age differences, those 
of the thought phase of history and of arithmetic problems; yet 
pupils of the eighth grade have a lower attainment than pupils 
of the sixth grade of the same mental age. If the scales measured 
what is acquired outside of school, rather than within, surely the 
chronologically younger pupils of the sixth grade would not know 
more than pupils of the same mental ages in the eighth grade, 
even though the latter had studied a year longer in school. Even 
in the case of reading ability, the average difference in mental age 
for two years is almost exactly what one would expect as the 
result of two years’ work in school: the difference between the 
sixth and eighth grade mid-scores is only one-half of the normal 
gain that should result from two years’ work. Nor was reading 
at all neglected in the seventh or eighth grade of this school system. 
The evidence in favor of the greater importance of mental age as 
compared with grade placement, even though it cannot be expressed 
with any degree of exactness, nevertheless seems almost unquestion- 
able. This does not mean, however, that the gifted pupil can be 
just as effectively taught in a lower grade as in a group of pupils 
similar in mental age. But it does imply that content of instruc- 
tion should be adjusted to mental age and present abilities. 


PREFATORY NOTE TO CHAPTER VII 


At the request of the Yearbook Committe, Mr. McCall has pre- 
sented in Chapter VII a brief summary of a study that had been 
reported in detail earlier. The reader may consider it as a con- 
tribution along the same general lines as the two preceding chap- 
ters, that is, as an attempt to measure the amount of alteration in 
school achievement that can be expected from alterations in the 
educational environment, particularly in connection with teaching 
or classroom methods. 

In this particular ease the environmental alteration was the 
segregation of bright pupils. Sixty-seven bright pupils who were 
taught for two years in classes with other bright children only 
were compared for achievement with 67 equally bright children 
who were taught for two years in classes with all sorts (psycho- 
logically speaking) of children. 

The outcome points to a definite advantage in segregation, 
although the conditions under which the study was made were not 
such as to permit this conclusion to be established beyond cavil from 
the statistical point of view. 

In the Twenty-Third Yearbook, Part I, in Section II, dealing 
with special studies of gifted children, will be found several re- 
ports of investigations dealing with this question of the effects of 
segregation upon the school attainments of intellectually superior 
children.— Editor. 
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CHAPTER VII 


COMPARISON OF THE EDUCATIONAL PROGRESS OF 
BRIGHT PUPILS IN ACCELERATED AND IN 
REGULAR CLASSES 


Wituiam A. McCaLu 
Teachers College, Columbia University, New York City 


PURPOSE 


This study compares the achievement in standard educational 
tests of 67 bright children who were taught in classes with other 
bright children for two years in a New York City public school 
with 67 unsegregated bright pupils scattered in several other New 
York City schools whose mental ages and chronological ages 
matched those of the segregated pupils, both pupil by pupil and for 
the group as a whole. Thus, both groups were equally bright. 
They differed chiefly in that the pupils in the former group were 
segregated whereas those in the latter group were taught in classes 
with pupils of widely and typically varying brightness, mental age, 
and chronological age. The 67 children in the segregated group 
were about equally distributed over Grades III to VII inclusive, 
and those in the non-segregated group were similarly distributed. 


Tests EMPLOYED 


The test used to measure intelligence was the National Intelli- 
gence Test, Scale A. The educational tests were the Thorndike- 
McCall Reading Scale, the Woody-McCall Mixed Fundamentals in 
Arithmetic, and selected words from the Ayres Spelling Scale. All 
tests were administered at the end of two years of segregation. 
Reading age, arithmetical age, and spelling age for each pupil 
were combined to get the educational age, and this was averaged for 
each group. 

ESSENTIAL RESULTS 


The following represent the essential findings of the investiga- 
tion. The mean educational age for the segregated group was 
145.42. For the non-segregated group it was 138.28. The differ- 
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ence of 7.14 is in favor of segregation. The standard deviation of 
the difference was 4.00, which yields an experimental coefficient 
of 0.62. Interpreted in words, this means that the segregated group 
had gained an excess of 7.14 months in 2 years, and we ean be 62 
percent certain that such segregation is in general more favorable 
to the growth of the traits measured than is non-segregation. 


CoMMENTS 


In an ideal experiment, gain from initial to final measurements 
would be preferable to final status only, but this study grew out of 
records collected for another purpose. Also, in an ideal experiment, 
an effort would be made to control possible differences in teaching 
efficiency, supervisory efficiency, and curricular content. It is pos- 
sible that all three of these variables were favorable to the segre- 
gated group, since the pupils were in an experimental school. The 
amount of influence, if any, of such variables upon the final differ- 
ence is not known. It is improbable that these influences were 
sufficient to explain all the superiority of the segregated group, 
particularly since it was the avowed object of the school to diver- 
sify the curriculum rather than to push the children forward in 
the particular traits measured. 


PREFATORY NOTE TO CHAPTER VIII 


This chapter continues the line of argument of the preceding 
chapters, though directing attention to a different environmental 
‘factor. The question here studied is: Can it be shown that the 
achievement of pupils, measured after twelve years of public-school 
instruction, is measurably superior in school systems that spend 
the most money in operating their schools. The answer is ‘No.’ 

The investigation shows that educational costs, measured as the 
ratio of total current expenditures to average daily attendance, have 
no correlation with the gross amount of school subject matter (as 
measured on the Iowa High-School Content Examination) learned 
during twelve years of schooling. Neither do expenditures show 
any correlation with the composite marks which graduates of the 
school systems in question are able to make during their first 
semester at college. 

If studies carried through in the future corroborate the indi- 
cations of this chapter, it will behoove our schools to take careful 
stock of their aims, their methods, and their results, and possibly 
to formulate new aims. If cheaply run schools are as effective as 
expensive schools, both in teaching pupils prescribed content and 
in preparing them to learn in the future, very important questions 
are raised. Should we spend less money for our schools than we 
now do? Perhaps no educator or social scientist would say we 
should, but nevertheless the school administrator can hardly avoid 
asking himself, when he reads this chapter, just what the educa- 
tional gains are that are supposedly assured by increasing the per 
capita expenditure for the operation of the school. 
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CHAPTER VIII 


EFFICIENCY OF TRAINING AS AFFECTED BY THE 
COST OF INSTRUCTION 


T. C. HoLy 
Ohio State University, Columbus, Ohio 
and : 
G. M. Rucu 
University of California, Berkeley, California 


Tur PROBLEM 


This investigation was directed at the determination of the 
relation between the amount of money spent by school systems per 
pupil in average daily attendance and the effectiveness of in- 
struction. 


Tur Measures USEep 


Four different measures were used in the attempt to evaluate 

this problem, as follows: 

(1) School costs; computed by the conventional formula, Total 
Current Expenditures? divided by Average Daily At- 
tendance 

(2) The Thorndike Intelligence Examination for High-School 
Graduates, Part I, Form B 

(3) The Iowa High-School Content Examination,’ Form A 

(4) Composite marks of high-school graduates earned during 
the first semester of the year 1923-24 at the State University 
of Iowa 


Extracts from a larger study by T. C. Holy entitled: ‘‘Relation Be- 
tween Cost Per A. D, A. and Quality of Scholarship in Iowa Public Schools,’’ 
1924. Unpublished to date. This study was made when both Professor Holy 
and Professor Ruch were connected with the State University of Iowa. 

2¢<Total Current Expenditures’’ was defined as the money expended for 
all purposes other than capital outlay, interest on bonded indebtedness, and 
bond payments. It includes the salaries of superintendents, supervisors, teach- 
ers, janitors, and other help; tuition costs; texts and library books; instruc- 
tional and janitorial supplies; transportation costs; fuel; rent; insurance; 
interest on temporary debt; and certain other items. 

* This is a test with 80 minutes’ working time, containing 400 items cover- 
ing the more important subjects of the high-school curriculum. It shows a 
reliability well above 0.95 for the subjects studied. 
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The school costs were figured for the school years 1920-21 and 
1921-22. Graduates of 26 Iowa high schools in the spring of 1923 
and all freshmen entering the State University of Iowa during the 
fall of 1923 were included. The marks earned in all college sub- 
jects were weighted by the method of standard measures to 
eliminate, as far as possible, differences in departmental standards. 

Both measures of efficiency of instruction (The Iowa High- 
School Content Examination and the composite marks) are ad- 
mittedly but partially satisfactory criteria, but both must be ac- 
corded considerable weight in evaluating the results of public- 
school training. The Iowa High-School Content Examination 
measures directly the effects of high-school instruction and %- 
directly to some extent the effects of elementary instruction. Col- 
lege marks are valid for purposes of this study to the extent to 
which they reflect previous preparation, habits of work, ete. 


Tue PuPits STUDIED 


In all, 1796 pupils were studied, but the present report will be 
confined to pupils who completed all 8 grades of elementary and 
all 4 grades of high-school instruction in the same Iowa school sys- 
tem. The cost per pupil in average daily attendance can be com- 
puted rather exactly for these pupils who were educated through- 
out within a given school system. 


CLASSIFICATION OF SCHOOLS 


Since school costs differ to some extent with the size of the school 
system, five classes of schools are considered, based upon number of 
pupils in average daily attendance, as follows: 


Class I I— 199 
Class II 200 — 399 
Class III 400 — 799 
Class IV 800 — 1699 
Class V 1700 — above 


THE RESULTS 


Table I shows the correlations between the three measures used _ 
for the study. In explanation, the values in the column r,, are 
read: ‘‘the correlation between accomplishment as measured by 
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the Iowa High-School Content Examination and cost per pupil 
in average daily attendance.’? The values in the column 
head r,,, may be read: ‘‘The correlation between accomplishment 
and cost with the influence of intelligence eliminated.’’ 


Tasty I.—CorreLations, with THEIR PROBABLE Errors, BETWEEN THREE 
Measures IN Five Criasses or SCHOOLS 
(Data for high-school examination.) 


Class of 

School Tig T13 Te T 12,3 N 
Lean —.036 + .073 67 + .040 .099 + .071 | —.14 87 
II alow ses 055 66 + .031 |—.016 + .056 | —.19 142 
ieee .067 + .052 67 + .028 17 + .050 | —.079 168 
TV et —.019 + .050 64 + .029 }—.18 + .049 .086 179 
Viger —.076 + .054 77 + .022 |—.055 + .054 .052 159 


The variables in this table are numbered as follows: 
1. Iowa High-School Content Examination. 

2. Cost. 

3. Thorndike Intelligence Examination. 


Table II presents similar coefficients of correlation where the 
first-semester college marks are used as measures of accomplish- 
ment instead of the Iowa High-School Content Examination. 


TaBLE II.—CorRELATIONS, WITH THEIR PROBABLE ERRORS, BETWEEN THREE 
MEASURES IN Five CLassEs OF SCHOOLS 
(Data for college marks.) 


Class of 


School Tis Tis T23 T12.3 N 
079 54 + .056 .017 + .080 |—.086 ql 
+ .064 61 + .044 .040 + .0638 |—.025 112 
+ .068 53 + .050 Sih) se alters) .059 95 
O10) 46 + .056 |—.026 + .071 |—.16 89 
+ .0638 59 + .042 |—.041 + .071 18 110 


The variables in this table are numbered as follows: 
1. Composite first semester college marks. 

2. Cost. 

8. Thorndike Intelligence Examination. 


CONCLUSIONS 


1. Tables I and II show no significant correlations between 
school costs and either of the two measures of effectiveness of in- 
struction, whether zero order or partial correlations eliminating 
intelligence are considered. 
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2. Intelligence, as measured by the Thorndike test, is of far 
more importance in determining the effectiveness of training than 
is cost per pupil in average daily attendance (Table I). 

3. So far as preparation for successful accomplishment in col- 
lege is concerned (whether fundamental training, habits of study, 
or what), the cost of public-school education seems not to influence 
to any important extent attainment in institutions of higher learn- 
ing (Table II). 

4. The criteria of the effectiveness of training, although prob- 
ably rather imperfect measures, would seem to possess enough value 
to permit effects of varying instructional costs to emerge if these 
were highly important influences in achievement. 


TaBueE II].—Tue Means Anp STANDARD DEVIATIONS OF THE MATERIAL USED 


IN TABLE I 
Means Standard Deviations 
Class oe 
LiTEICORl ee Cost Thorndike | I. H.C.E. Cost Thorndike 
eae scraih 174.4 |$115.82 84.6 33.9 |$ 47.80 19.4 
DRA tere eek 174.6 155.05 92.4 46.9 44.20 QAR 
Oe Gales 180.4 100.11 95.5 46.5 18.15 16.9 
CUS er 191.8 77.95 97.0 54.2 14.82 18.7 
ap eaters | 189.9 94.11 99.7 58.5 11.82 DS 


Taste IV.—TuHEe MEANS AND STANDARD DEVIATIONS OF THE MaTERIAL USED 
IN TABLE IT 


Means Standard Deviations 
Class cc 
Marks Cost Thorndike} Marks Cost Thorndike 
err aiker ee ate 12.0 |$130.16 87.1 29.0 |$ 34.40 20.0 
Dad | A kaa 31.0 13%e25 93.0 13.6 BO 21.6 
See aas S280 100.16 101.42 12.8 24.45 mez 
BOM, ahs ots DOD, 82.23 92.6 1275 15.54 19.1 


ON eiekte eee Neri 36.0 89.82 102.9 13.6 11.04 20.3 


PREFATORY NOTE TO CHAPTER IX 


The conclusion of this chapter seems quite clearly established— 
“‘that inferior students study somewhat more hours, [than do 
brighter students] but that this slight increase in duration of study 
does not compensate at all for their lack of ability.’’ The correla- 
tion between the number of hours per week which a group of col- 
lege students spent in studying had a slight negative correlation 
with the grade-points they earned during their first college semester. 
This correlation became zero when proper discount was made for 
their composite intelligence scores on four examinations. 

It may seem surprising that, even for students of equal mental 
ability, time spent in study fails to differentiate them according 
to success in class work. Certainly, we have all known college stu- 
dents who ‘‘flunked’’ because they did not apply themselves, and 
others, not necessarily of the highest mental caliber, who graduated 
with honors by dint of hard work. Possibly the zero partial corre- 
lation can be partly accounted for by one or more of the following 
conditions: 

1. The unreliability of college marks. 

2. Unrepresentativeness of the ‘study time’ measures, owing 
to the fact that the students kept records of study only during one 
week. 

3. The tendency of many students to study only enough to 
‘get by.’ 

4. Differences in difficulty of various courses, thus necessitating 
different amounts of study for different classes to insure the same 
class marks. 

This investigation does not prove that inferior students could 
not compensate their inferior ability by increased time spent in 
study; but it does show that, as a group at least, they do not do it. 
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CHAPTER IX 


ACHIEVEMENT AS AFFECTED BY AMOUNT OF TIME 
SPENT IN STUDY? 


Lonzo JONES 
Graceland College, Lamoni, Iowa 
AND 
G. M. RucH 
University of California, Berkeley, California 


INTRODUCTION 

It is a common assumption that the amount of time spent in 
study is an important factor in determining accomplishment. This 
is very probably true within certain limits if other factors are taken 
into account. One variable in the situation would seem to be the 
fact that the amount of time spent in study is not a very perfect 
measure of the effectiveness of study. Another factor would seem 
to be differences in the abilities of the students—whether these be 
differences in original endowment or in previous achievement. In 
the third place, there are probably definite limits within which 
time spent in study influences learning. 

A practical difficulty in recording time spent in study for ex- 
perimental investigation arises from the fact that much must be 
left to the honesty of the students who keep their study records. 
This limitation is inherent in the data reported here, except that 
two facts guarantee that dishonesty was kept at a minimum, viz.; 
(1) confidence and understanding on the part of the students was 
established by means of preliminary personal interviews and ex- 
planations, and (2) subsidiary investigations showed substantial 
agreements in the records kept by students on themselves and those 
kept for the same students by responsible persons in charge of their 
living groups. 

THE METHOD 

Two groups of college freshmen will be considered here; first, 
130 unselected students, and second, 150 freshmen so selected as to 
provide three groups of 50 each that differed widely in mental 
ability. 


1 This investigation was conducted when both of the authors were connected 
with the State University of Iowa. 
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All these students kept an hour-by-hour time chart for one 
typical school week. A week was chosen that would be free from 
any unusual disturbing factors in study, such as major football or 
other athletic events, a vacation immediately preceding or follow- 
ing, or a final examination period, ete. 

A three-hour mental examination had previously been given to 
all students included in the study. This consisted of four tests: 

Thorndike Intelligence Examination, Part I, Form EB 
Thorndike Intelligence Examination, Part I, Form O 
The Morgan Mental Test 

The Iowa Comprehension Test 


The results of these four tests were weighted and pooled and 
then expressed as percentile ranks for a group of about 1200 State 
University of Iowa freshmen. 


THe RESULTS 


Table I shows the average number of hours spent in study for 
the week selected and the average grade-points earned at the end 
of the first semester of the year 1923-24—the semester which in- 
cluded the week during which the time records were obtained. 


TasiE I.—Time Spent In Stupy AND GrRADE-PoInts EARNED BY THREE 
GROUPS OF FRESHMEN AT lowA 


Average No. Average Grade- 
G Hours Spent Points Earned 
ph) in Study for During the 
One Week Semester 


A. 50 superior men (Top 10 percent in 
the mental examination).......... 20 era ie 46.1 + .99 


B. 50 average men (Between the 43d and 
the 56th percentile in the mental 
EXAMINATION) i= Hacercete eect cat See as ey 23.8 + 1.25 


C. 50 inferior men (Bottom 15 percent 
in the mental examination)....... S455 Sor 7.0 + 1.44 


Table I makes it appear that the dull students study somewhat 
more in terms of hours than do the superior ones; the difference, 
however, is not very marked. There is, of course, the possibility 
that the records were ‘padded,’ but this disturbance would have 
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to be considerably greater for one group than another to change 
significantly the complexion of Table I. The data show much 
wider differences in grades earned by the three ability groups than 
differences in time devoted to studying. In spite of the apparently 
greater studiousness of the less able students, they accomplish far 
less in the judgment of their instructors. Two interpretations are 
possible: (1) the inferior student really does study more be- 
cause of necessity, but even then fails to overcome the handicaps 
of relative lack of ability; or (2) the inferior student is the more 


Tasie I].—Time Spent in Stupy AND Grapz-Pornts Earnep By 130 Un- 
SELECTED FRESHMEN AT lowA 


Grade-Points| Percentile Ranks| Hours Spent in 
Earned on the Mental Study for the 
Examination | Week in Question 


AYVEraves nie iia oes oe 26.2 59) 7 31.4 
Standard Deviation.... 16.2 30.6 9.0 
Ranges scee ary eee — 20 to 59 1 to 99 13 to 58 
he ate between (1) grade-points, (2) percentile rank, and (3) hours of 

study. 

Tr = -69 + .031 T12..3 = .66 

Tis = —.28 + .055 To3,1 = —.32 

Te = —.41 + .049 Tr.2 = .0044 

188 745) .69 


dishonest about his records. It is our opinion, based upon observa- 
tion of these students over a period of a year, that the first inter- 
pretation more nearly pictures the real situation. In many eases 
it was possible to confirm the substantial accuracy of the time 
records submitted. 

Table II shows similar results for 130 unselected university 
freshmen for the same year and class, together with the correla- 
tions between grade-points, ability, and hours spent in study for 
this typical group of freshmen. It will be noted that mental ability 
shows a correlation of .69 with grade-points earned and that hours 
spent in study shows a slight, but significant, negative correlation 
(—.41) with achievement. However, when the factor of mental 
ability is eliminated through partial correlations, the former rela- 
tionship remains constant, while study time disappears as a factor 
operating over and above differences in ability. 
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CoNCLUSION 


If these data are to be taken as typical, the conclusion is forced 
upon us that inferior students study somewhat more hours, but that 
this slight increase in duration of study does not compensate at all 
for their lack of ability. Study time, as voluntarily scheduled by 
the average student, would appear to be a factor in achievement of 
much less importance than ability as measured by the mental exam- 
inations employed. The amount of study seems to be dictated by 
the need of the individual student to maintain a quality of ac- 
complishment which will permit him to remain in college, and this 
need is inversely proportional to his ability. 

To avoid misinterpretation, let us note that these results are 
based upon the study time as voluntarily scheduled by average 
freshman students. It need not be concluded that additional time 
spent in study is futile, but merely that the slight increase that was 
actually reported by the less gifted students is not a decisive factor. 
The problem may be raised, but is not answered here, whether the 
time spent in study by the less able students could be increased to 
such an amount as to produce a marked increase in grade-points 
earned by them. 
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PREFATORY NOTE TO CHAPTER X 


The chapters in this second section, Nos. X, XI, and XIII, deal 
with the relative influence of heredity and environment upon cer- 
tain special traits, in contrast with the preceding chapters which 
were concerned with the relative influence of heredity and environ- 
ment upon general school performance. In addition to these three 
chapters, the reader may consult Chapter XV for a summary of 
the literature dealing with the relation of heredity and environ- 
ment to another special trait, musicality. 

Comparatively few studies have been published dealing with 
this influence of nature and nurture upon specific mental traits. 
Anderson’s chapter on mechanical ability in relation to environ- 
ment is, we believe, the first study of its kind. 

The results of his study indicate that little or no influence can. 
be attributed to the mechanical activities of a boy’s father, the tools 
which the boy’s father owns, or the cultural status of the home, 
upon the boy’s mechanical ability when this is judged by ratings 
upon actual projects made in school shops, tests of ‘shop informa- 
tion’ at the close of shop courses, and scores on eight standard 
mechanical tests. The absence of any measurable influence from 
these environmental factors leads to the surmise that the undoubted 
differences that exist in mechanical ability are mainly innate. 
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CHAPTER X 


THE RELATIONSHIPS OF CERTAIN ENVIRONMENTAL 
FACTORS TO MEASURES OF MECHANICAL ABILITY? 


L. DEWEY ANDERSON 
University of Minnesota, Minneapolis, Minnesota 


One of the problems in a research dealing with mechanical 
ability which has been in progress since 1923 at the University of 
Minnesota was to determine the relationship of environmental con- 
ditions to mechanical ability.. In the effort to get an inventory of 
the former the particular-item and the rating-scale methods were 
studied to determine their relative merits for this purpose. It was 
finally decided to use the particular-item method, but to have 
enough items so that scores might be secured for different specific 
aspects of the environment. 

A preliminary schedule of 500 questions was first made and 
used in interviewing the parents of ten boys. The social worker 
who made the calls, found that many parents were reluctant to 
give certain information and also that the mother, who was usually 
interviewed, did not know the names of the mechanical tools owned 
by her husband or son. In order to remedy these difficulties, the 
questionnaire was shortened and divided into two parts—Part I 
(Sections I, II, III, IV, and V) to be filled out by the social worker 
on her call, and Part II (Sections V and VII) to be filled out and 
mailed by the parents. The questionnaire was so arranged that 
items dealing with given aspects of the environment were grouped 
in a section by themselves. The answers were recorded by inserting 
in the parenthesis to the left of the items a ‘‘1’’ for possession (or 
an affirmative answer) and ‘‘0’’ for non-possession (or a negative 
answer). 

A trained social worker? secured the information in Part I by 
visiting the homes and questioning the parents of the boys in the 


1This study was a part of an investigation of mechanical ability directed 
by R. M. Elliot and D. G. Paterson, of the University of Minnesota. The in- 
vestigation was sponsored by the Committee on Human Migrations of the 
National Research Council. Funds were supplied by the Laura Spelman Rocke- 
feller Foundation. 

2Mrs. G. B. Braithwaite, trained in social work at Simmons College, with 
two years of experience as visiting teacher in the Rochester, New York, public 
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experimental group that was next studied. This experimental 
group consisted of 150 boys enrolled at the Jordan Junior High 
School, Minneapolis, for whom we had scores in eight mechanical 
ability tests and reliable objective indices of success in mechanical 
shop courses. 

In interviewing the mothers of the ten boys in the preliminary, 
or trial, group, the social worker felt that the progress of the inter- 
views was hampered because she did not know the boys. In order 
that she might become acquainted with the boys, she called each 
boy out of the classroom for the ostensible purpose of giving him a 
short psychological test. At this meeting she explained to him that 
she would visit his mother within a few days. These acquaintance- 
ships proved very valuable, as they gave the field worker personal 
‘leads’ for conversation with the mother when it was evident that 
a boy was not interested in mechanical things. At the close of the 
interview the social worker left the second part of the question- 
naire with the mother, and asked her to fill it out and mail it. The 
time of the individual call varied from twenty minutes to three 
hours. Before the social worker made a call, the principal of the 
school in which the boys were enrolled sent the following letter to 
the parents. 


WETS ATCT Sa. cites ott cents a. cfs ciciec etd oes 3 
Minneapolis, Minnesota 
Dear Nirvandi Mires. Neqisss lets cis ciocte chore cies 

Ibastaviecatry OULrSOlnepine marca ciecics cickee setts took some tests in the 
Jordan School for the purpose of determining his mechanical ability and 
of aiding us more definitely to arrange courses of study to meet the 
needs of the boys of the school. In order to have more information 
LIpOMaw hich tor WoLkme Nissen cacao cere will visit your home in 
the next few days, asking for some information regarding your boy and 
his out-of-school activities. 

We will appreciate it very much if you will give Mrs. ............. 
as much assistance as possible in securing the information that we need. 
Any information given will, of course, be confidential in our school 
office; so you need not hesitate to talk very freely with her. I thank 
you for this codperation. 

Sincerely, 


BUCO RUUD GO DO DOD COME pean OG SOsor Principal 
Jordan Junior High School 
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It is probable that the codperation of the principal in sending 
the letters and in giving a talk to the boys was mainly responsible 


for the excellent results. 


The parents of 143 of the 150 boys were 


visited, and 185 of the 143 families returned the second part of the 


questionnaire. 


After the data were collected, the items were arranged in seven 
classifications, and then the number of affirmative responses was 
added to get a score for each. The number of items in each classi- 
fication is given in the last column of Table I. 

The items in each section of the questionnaire are given below. 
Preceding each item is given the percent possessing (or the percent 


of affirmative responses). 


QUESTIONNAIRE ON CULTURAL AND MECHANICAL 
ENVIRONMENT 


(Rearranged according to classifications used in statistical analysis) 


The percent of families ‘possessing’ an item is placed in the left hand 


column. 
possession, 


Items are arranged in order according to the frequency of 


I. CuLtursL STATUS OF FAMILY 


Percent 
Possessing Name of Item 


(97) Lighting—electric 

(96) Electric iron 

(95) Sanitation—toilet 

(95) Bath tub 

(75) Homes having six rooms or 
more 

(72) Furnace 

(69) Radio 

(68) Floor coverings—polished, 
expensive rug 

(64) Kitchen floor—painted, lino- 
leum 

(63) Phonograph 

(63) Telephone 

(61) Hot water system 

(57) Automobile other than truck 
owned by the family 

(57) Piano—upright, grand 


Percent 

Possessing Name of Item 

(54) Living room appraisal—ex- 
cellent or good 

(53) Lamps—reading or floor 

(45) Electric washer 

(44) Electric vacuum sweeper 

(44) Dining room appraisal—ex- 
cellent or good 

(35) Electric toaster 

(21) Purchased radio- 

(13) Electric sewing machine 

( 5) Typewriter 

( 1) Electric percolator 

( 1) Radio enclosed in cabinet 

( 0) Electric stove 

( 0) Artificial refrigeration 

( 0) Fireless cooker 

( 0) Steam pressure cooker 


140 THE TWENTY-SEVENTH YEARBOOK 


II. Literary INTERESTS 


Percent 

Possessing Name of Item 

(88) Boy has library card 

(56) 50 or more books in home 

(45) Number of magazines sub- 
scribed for—7 or more 

(45) Number of newspapers sub- 
scribed for—2 or more 


Percent 
Possessing Name of Item 


(41) Boy reads 6 or more books a 


month 

(32) Education of father—above 
8th grade 

(30) Education of mother—above 
8th grade 


III. RecrEATIONAL INTERESTS OF Boy 


(93) Can spin tops 

(91) Can ice skate 

(91) Can play marbles 

(87) Can fly kite 

(85) Can swim 

(83) Can ride a bicycle 

(80) Can play football 

(79) Can play baseball 

(71) Can walk on stilts 

(68) Takes care of pets 

(68) Wanted a tool chest after 
taking manual training 


(64) Has had a “Meccano” or 
“Erector” set 

(58) Knows how to row a boat 

(57) Wanted a tool chest before 
taking manual training 

(57) Knows how to play basket- 
ball 

(53) Knows how to play croquet 

(50) Knows how to wrestle 

(48) Knows how to play shinny 

(23) Knows how to roller skate 

( 1) Knows how to play pool 

( 1) Knows how to bowl 


IV. MECHANICAL OPERATIONS OF FATHER 


(83) Put on screens 

(69) Replaced washers in faucets 

(62) Replaced electrical fuses 

(61) Painted houses 

(60) Painted or varnished wood- 
work or floors 

(59) Puttied around windows 

(59) Fixed window shades 

(59) Put in window panes 

(56) Tightened springs in window 
shades 

(54) Replaced light sockets 

(53) Hung window blinds, screen 
doors or storm doors 

(52) Repaired electrical circuits 

(51) Repaired automobile engines 

(51) Put up shelves 


(50) Sharpened scissors and 
knives 

(50) Repaired plumbing 

(43) Soldered 

(42) Cleaned out flues in stoves 
or furnaces 

(37) Planed doors to fit 

(35) Sharpened carpenter tools 

(33) Fixed leaks in kettles 

(31) Half-soled shoes 

(26) Shingled or repaired a roof 

(24) Made a radio set 

(23) Riveted straps 

(23) Sharpened saws 

(21) Mended screens 

(21) Built a garage 

(19) Made an ironing board 
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Percent 
Possessing Name of Item 


(18) Made a drain board 

(18) Repaired sewing machine 

(16) Made a chest of drawers 

(16) Replaced screening in doors 
or screens 

(15) Repaired furniture 

(15) Tinted or kalsomined walls 

(15) Repaired vacuum sweeper 

(14) Sharpened lawn mowers 

(13) Made trellises for vines 

(13) Built coal sheds 

(13) Made a kitchen cabinet 

(12) Papered walls 

(11) Threaded pipes 


Percent 
Possessing Name of Item 


(11) Made a bread board 

(11) Put down carpet or oilcloth 

(10) Built chicken coops 

(10) Built chicken roosts 

(10) Put pole in clothes closet 

( 8) Mended door stoops 

( 7) Made a footstool 

6) Made a book case 

5) Made a taboret 

5) Made a cedar chest or box 

2) Made bookends or a book- 
rack 

2) Made a necktie rack 

1) Put in water backs in stove 


V. MECHANICAL OPERATIONS OF SON 


(68) Put on screens 

(59) Replaced fuses 

(53) Made a radio set 

(52) Painted or varnished wood- 
work or floors 

(52) Repaired engine in automo- 
bile 

(52) Repaired light sockets 

(49) Repaired electrical circuits 

(42) Painted houses 

(35) Replaced washers in faucets 

(33) Soldered 

(33) Sharpened scissors and 
knives 

(30) Put up shelves 

(28) Cleaned out flues in stoves 
or furnaces 

(28) Made a necktie rack 

(28) Fixed leaks in kettles 

(22) Sharpened carpenter tools 

(22) Puttied around windows 

(22) Riveted straps 

(21) Tightened springs in window 
shades 

(14) Repaired sewing machines 

(13) Hung window blinds, 
screens, or stormdoors 


(12) Repaired plumbing 

(11) Mended screens 

(10) Made a footstool 

(10) Shingled or repaired a roof 

( 9) Repaired vacuum sweepers 

( 8) Built a garage 

( 8) Made book ends or book- 
racks 

( 8) Replaced screen wire in 
doors or screens 

( 8) Repaired furniture 

( 7) Built coal sheds 

( 7) Put down carpets or oilcloth 

( 7) Tinted or kalsomined walls 

( 7) Made trellises for vines 

) Put pole in clothes closet 

) Planed doors down to fit 

Made bread boards 

) Half-soled shoes 

) Built chicken roosts 

) Mended door stoops 

) Made a taboret 

) Built chicken coops 

) Made a cedar chest or box 

) Threaded pipes 

) Sharpened lawn mowers 

) Papered walls 
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Percent Percent 

Possessing Name of Item Possessing Name of Item 

( 2) Made a kitchen cabinet ( 1) Made a chest of drawers 

( 2) Sharpened saws ( 1) Made drain boards 

( 2) Made an ironing board ( 1) Made a bookcase 
VI. Toots Ownep By FATHER 

(89) Screwdriver (41) Vise 

(88) Pliers (41) Alligator wrench 

(86) Ax (41) Compass 

(84) Claw hammer (41) Jack plane 

(83) File (39) Emery wheel 

(83) Putty-knife (38) Miter square 

(80) Monkey wrench (37) “S” wrench 

(76) Chisels (33) Yankee drill 

(75) Carpenter’s rule (31) Smoothing plane 

(71) Whetstone (31) Shoemaker’s hammer 

(69) Pipe wrench (31) Countersink bit 

(67) Brace (29) Saw set 

(67) Ordinary hatchet (28) Marking gauge 

(66) Glass cutter (28) Drawknife 

(66) Buck saw (27) Miter box for saw 

(64) Cross cut saw (27) Outside calipers 

(57) Plain soldering iron (25) Iron clamps 

(56) Level (25) Shoemaker’s nippers 

(55) Nail punches (21) Cabinet scraper 

(54) Small square (21) Ratchet drill 

(54) Block plane (20) Blow torch 

(54) Tinner’s snips (20) Inside calipers 

(53) Machinist’s hammer (20) Shingling hammer 

(53) Awl (19) Riveting hammer 

(50) Steel square (19) Reamer 

(48) Shoemaker’s stand (19) Rivet set 

(48) Rip saw (19) Plumbing force pump 

(47) Hack saw (18) Coping saw 

(44) Work bench (15) Grooving plane 

(44) Keyhole saw (13) Wood clamp 

(44) Leather punch ( 8) Electric soldering iron 

(43) Dividers ( 8) Threading machine 

VII. Toots Ownep sy Son 

(46) Pliers (30) File 

(40) Screwdriver (25) Hatchet 

(35) Claw hammer (25) Chisel 


(31) Compass (25) Axe 
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Percent 

Possessing Name of Item 
(22) Carpenter rule 

(21) Monkey wrench 

(18) Soldering iron (plain) 


Percent 
Possessing Name of Item 


(5) 
( 5) 


Alligator wrench 
Marking gauge 
Tron clamps 


(17) Cross-cut saw ( 4) Hack saw 

(16) Glass cutter ( 4) Miter box for saw 
(15) Bits ( 3) Riveting hammer 
(15) Whetstone ( 3) Cabinet scraper 

(15) Brace ( 3) Shingling hatchet 
(12) Nail punch ( 3) Shoemaker’s peg snippers 
(10) Block plane ( 3) Shoemaker’s stand 

( 9) Vise ( 3) Wood clamp 

( 9) Machinist hammer ( 2) “S” wrench 

(9) Awl ( 2) Saw set 

( 9) Buck saw ( 2) Shoemaker’s hammer 
( 9) Rip saw ( 2) Smoothing plane 

( 9) Small square ( 2) Inside calipers 

( 9) Steel square ( 2) Outside calipers 

( 9) Pipe wrench ( 2) Plumbing force pump or cup 
( 7) Dividers ( 2) Electric soldering iron 
( 7) Leather punch ( 2) Yankee drill 

( 7) Tinner’s snips ( 1) Blow torch 

( 5) Drawknife ( 1) Ratchet drill 

( 5) Emery wheel ( 1) Rivet set 

(5) Level ( 0) Grooving plane 

( 5) Jack plane ( 0) Reamer 

( 5) Coping saw ( 0) Miter square 

(5) Keyhole saw ( 0) Threading machine 


The reliability coefficients (see Table I) of the seven classifica- 


tions were secured by correlating the sum of affirmative responses 
in the odd-numbered items with the sum on the even-numbered 
items and applying the Brown-Spearman reliability formula. 

Only one of the seven coefficients, Literary Interests, which 
contained only seven items, was so unreliable (.47) as to make its 
use questionable in further statistical treatment. The reliabilities 
for the other six classifications indicate that relatively the same 
index in each aspect of a home would be obtainable from a second 
independent set of items. 

One of the most important questions concerning the use of 
environmental questionnaires is: ‘‘Are the variabilities of the 
scores large enough to insure the heterogeneity of the group in these 
aspects ?’’ 
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This question of variability is answered in part by an inspection 
of the distributions of the scores which are shown graphically in 
Figure I. 
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Figure I.—DISTRIBUTION OF SCORES IN SEVEN ENVIRONMENTAL MEASURES 


The distributions are fairly normal in all but three sections, 
Son’s Mechanical Operations, Father’s Mechanical Operations, and 
Tools Owned by Son. In all cases except Tools Owned by Son, the 
scores are distributed throughout almost the entire possible range. 
For example, in Cultural Status the actual range is 2-26 and the 
possible range is 0-30. These two facts, wide range and normal 
distribution of scores within the ranges, indicate that the test group 
was a fairly heterogeneous one in respect to all the environmental 
aspects except Son’s Mechanical Operations and Tools Owned by 
Son. 

The intercorrelations of these environmental measures are given 
in Table I. If we consider only the classifications relating to the 
mechanical environment, the average intercorrelation is .24. The 
measures are somewhat related to each other—particularly is Me- 
chanical Operations of Father related to the other items. If we 
eliminate Tools Owned by Son, we get an average intercorrelation 
of .46 for three other measures of mechanical environment—Me- 
chanical Operations of Father, Mechanical Operations of Son, Tools 
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Owned by Father. The average intercorrelation of Cultural Status, 
Literary Interests, and Recreational Interests (economic and cul- 
tural factors) with the mechanical classifications (.10) indicates 
that more than one phase of environment is being measured by our 
questionnaire. The average intercorrelation of the three non- 
mechanical (7.¢e., economic and cultural) aspects of environment is 
.25. This indicates a somewhat higher inter-relationship among 
these non-mechanical phases of the environment than exists between 
the non-mechanical and the mechanical phases. 

These intercorrelations of economic and cultural status are not 
high enough to indicate that the non-mechanical classifications are 
measuring just one aspect of the environment. They only indicate 
that the non-mechanical classifications are measuring something of 
a different nature than the mechanical classifications and that the 
mechanical classifications are in turn measuring related, but rather 
specific, factors which we concluded from their apparent nature 
to be measures of mechanical environment. 

In analyzing this table we found some evidence that the pos- 
session of material wealth, as indicated by Cultural Status and 
Literary Interests, was somewhat detrimental to mechanical opera- 
tions and the possession of tools. A family with fair economic 
means probably hires:somebody to do much of the repairing and 
odd jobs about the house. Therefore, it is not necessary for the 
father and son to do this work and consequently the son has more 
time left to read and play. The positive correlations (.22) between 
Recreational Interests and Cultural Status and (.33) between 
Literary Interests and Cultural Status and the lack of positive rela- 
tionships between Cultural Status and the several measures of 
the mechanical environment indicate the probable truth of this 
statement. 

To determine the relation of these environmental factors to 
mechanical ability, the seven aspects of the environment were corre- 
lated with two mechanical shop criteria, or scores in mechanical 
success. The shop success scores indicated (1) success in manipula- 
tion of tools and materials, and (2) amount of information acquired 
about tools and materials. 

In obtaining the measures of manipulative skill, the projects 
made by the boys were subjected to objective measurement on many 
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well-defined and specific parts. These measurements were com- 
bined to give a general measure of manipulative success in the type 
of shop work covered by junior-high-school courses in sheet metal, 
woodwork, electricity, printing, and mechanical drawing. The 
amount of information acquired in each of these shops was meas- 
ured by multiple-choice information tests given at the end of the 
course, the scores of which were combined to give a general measure 
of ‘shop information.’ The correlations between thesenvironmental 
aspects and the shop-success measures are given in Table II. 


Taste I].—Tue CorreLations oF THE SEVEN ENVIRONMENTAL FACTORS WITH 
THE MBmASURES OF MANIPULATIVE AND OF INFORMATIONAL 
STANDINGS IN MrecHanicat Lings (100 Casss) 


Correlation with | Correlation with 


Factor Manipulative Informational 
Criterion Criterion 
Cultural status of family................ — .07 01 
Literary interests of son................. .13 .22 
Recreational interests of son............. .O1 712 
Mechanical operations of father.......... .09 24 
Mechanical operations of son............ .30 .35 
oolstownedsby-father eer teeinstictee se .O1 -1l 


Loolsiownedibyison are went eae .10 — .02 


All of these correlations are low. The measures of mechanical 
environment show slightly higher relationship with mechanical 
ability than do those of non-mechanical environment, but are too 
low to warrant the conclusion that mechanical shop success is sub- 
stantially related to those aspects of the environment measured by 
our questionnaire. 

The correlations of the mechanical test scores on the individual 
tests of our mechanical prognosis battery also show low relation- 
ships with the measured environmental factor. These are given in 
Table IIT. 

The low relationships in Tables II and III are not caused by 
low reliability of the criteria or the prognosis tests, since these all 
have high reliability. The reliability of the manipulative criterion 
is .76. We do not know the reliability of the informational eri- 
terion, but the reliabilities of the separate tests composing it range 
from .71 to .84. The reliabilities of the predictive tests are some- 
what higher (see Table III). 
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There is some question concerning the effect of the form of the 
distributions of the three factors, Tools Owned by Son, Father’s 
Mechanical Operations, and Son’s Mechanical Operations, on the 
size of correlations with the scores of shop success. The correlations 
of Son’s Mechanical Operations with the Manipulation Criterion 
and the Information Criterion were, respectively, .30 and .85. An 
inspection of the correlation data sheets did not show that an 
appreciable increase in these correlations would result if the forms 
of the distributions were corrected. A more exact method might 
raise the figures somewhat, but it is doubtful if there would be very 
marked increases. 

The correlations of the environmental classifications and the 
Otis Group Test (Higher Examination, Form A) intelligence quo- 
tients are in the last column of Table III. Literary Interests of 
the Son shows a comparatively high correlation (.86) with the I.Q., 
whereas the mechanical classifications show very low correlations 
with it; two of them are positive and two negative. 

Our results, then, indicate that there is practically no corre- 
spondence between most of the environmental factors we studied 
and either shop success on the one hand or scores in mechanical 
prognosis tests on the other hand. Certain tendencies in the corre- 
lations indicate that an environment in which the boy does many 
mechanical things is better for the development of mechanical 
ability, so far as success in shop work is concerned, than. an en- 
vironment in which the father does many things. This last kind of 
environment is more conducive to the boy’s securing a larger 
amount of mechanical wnformation in shop courses. The correla- 
tions which substantiate these statements are: 


Father’s Mechanieal Operations and the Manipulative 


Oritericiveeemss eae reer teats so sis weir le byert eta glee .09 
Father’s Mechanical Operations and the Information 
GrICeLLOM este mer eta, Leena sion oa asa see een es 24 


Tools Owned by Father and the Manipulative Criterion .01 

Tools Owned by Father and the Information Criterion .11 

A high cultural (economic) status of the family is not con- 
ducive to the ownership of tools by either father or son, or to the 
doing of many mechanical operations by either. The mechanical 
‘opportunities’ of boys tend, then, to be restricted by either a large 
family income or by unwise budgeting of family expenditures. 
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SuMMARY OF RESULTS 


1. Environments of boys in a junior high school are vastly 
dissimilar in regard to cultural, literary, and mechanical oppor- 
tunities, attitudes toward mechanical work, and family wealth. 
The phrase ‘‘equality of opportunity’’ is, generally speaking, a 
meaningless term. 

2. The intercorrelations of seven aspects of the environment 
were low. Intercorrelations within the three non-mechanical aspects 
on the one hand and within the four mechanical aspects on the 
other hand, were higher than the intercorrelations between these 
two different sections. 

3. In general, possession of economic wealth does not tend to 
increase the amount of a boy’s mechanical environment. 

4. The correlations between the seven aspects of environment 
both with measures of shop success and with scores in mechanical 
tests were low. 

5. An environment in which the father does many mechanical 
things was more related to the information that the boy secures in 
the shop courses than to his manipulative skill. 

6. The method of procedure followed in the research did not 
permit decisions concerning causal relationships. We do not know 
whether the doing of mechanical things about the house makes a 
boy more mechanical or whether a boy with mechanical ability will 
do more things about the house. 

7. If it is safe to assume that what we did not measure by the 
questionnaire is inherited, then our data lead to the inference that 
mechanical ability is in large measure an inherited ability. A con- 
clusive investigation of the genesis of mechanical ability is needed 
—one that shall take full account of the factors and the changes 
in factors covered by our questionnaire, also the amount of me- 
chanical practice of the pupils as well as their attitudes towards 
mechanical work. 


PREFATORY NOTE TO CHAPTER XI 


This chapter deals with factors conditioning a special trait, 
handedness. Owing to the small number of cases thus far studied, 
Miss Downey’s results are, of course, to be taken as suggestive, 
rather than conclusive. 

The inference is drawn that dextrality types are probably in- 
born, since they are very definite in form, appear with different 
frequencies in the two sexes, and are manifest very early in life. 
Forty-nine children of age 2-9 to 5-10 tested for this investigation 
showed exactly the same dextrality types that adults do, although 
‘‘a slight percentage of unstable reactions was found in the case 
of the children.’’ 

Evidence is also presented which shows that the mental age at 
which a child can distinguish between left and right (Binet six-year 
level test) may possibly be dependent in part ‘‘upon the dextrality 
type and the harmony or lack of harmony between hand and eye.”’ 

A third type of data deals with the coaching of nine children 
between mental ages of 4-10 and 5-10 who had failed the left-right 
test of the Binet. Four of these learned the distinction perfectly, 
and three of the remaining five approached ‘‘the borderline of 
complete success. ”’ 
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CHAPTER XI 
DEXTRALITY TYPES AND THE PRESCHOOL CHILD 


JUNE E. DOWNEY 
University of Wyoming, Laramie, Wyoming 


TYPES OF DEXTRALITY 


Investigation of the handedness of adults carried on under 
financial aid from the Committee on Scientific Problems of Human 
Migration of the National Research Counsel, has revealed the extra- 
ordinary fruitfulness of a classification of individuals according to 
the types of dextrality suggested by Rife in the Psychological Re- 
view, 1922. Bimanual activities, such as batting and sweeping, as 
well as unimanual ones, such as throwing, are utilized in classifica- 
tion. In bimanual activities, the hand closer to the functioning end 
of the implement used is considered the preferred hand. One is 
able to establish an individual’s handedness formula by noting his 
preferred hand in such unimanual activities as throwing, writing, 
eating, and the like, and in such bimanual activities as batting and 
sweeping. The six main dextrality types are the following: RRR, 
in which the right hand is the preferred hand in both unimanual 
and bimanual activities; RLL, in which the right hand is the pre- 
ferred hand for unimanual operations but the left hand for bi- 
manual ones; RRL, a mixed type, in which the right hand is 
preferred in unimanual activities but there is a mixed preference 
in bimanual operations, in that the right hand is dominant in bi- 
manual activities of a high grade (batting) and the left hand in 
bimanual operations low in the scale (sweeping). The RRR indi- 
vidual throws, bats, and sweeps right-handed. The RLL individual 
throws right-handed, bats and sweeps left-handed. The RRL indi- 
vidual throws and bats right-handed, but sweeps left-handed. LULL, 
LRR, and LLR are left-handed types corresponding to the right- 
handed types just described. 

It is possible, also, to describe many subclasses in a way that 
space here does not permit. Reference, however, must be made to 
the fixating eye, which is of primary importance. Handedness 
formulas, taken in connection with the fixating eye, give groups 
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which are apparently characterized by psychic traits of considerable 
significance. Results obtained from classification of some 1500 
adults, with tentative conclusions that open up a wide field of 
work, have appeared in the American Journal of Psychology, 38: 
1927, 317-367. 


INHERITANCE OF DEXTRALITY TYPE 


The question whether dextrality types point to native differences 
or are the product of the social environment cannot be answered 
at present, although the following facts suggest the likelihood of 
these types having a native, rather than a cultural, origin: 

(1) The exact correspondence existing between RRR, RLL, 
RRL, and LLL, LRR and LLR types; 

(2) The failure to find an RLR or LRL type, although a few 
individuals do give this combination ; 

(8) The existence of sex differences in the frequency with which 
certain types occur and also differences in the relative frequency 
with which certain types occur in superior and inferior groups; 

(4) The possibility that structural differences (bone-measure- 
ments, and degree of finger-print symmetry for corresponding fin- 
gers of the two hands) are correlated with dextrality types. 


DEXTRALITY TYPES IN THE PRESCHOOL CHILD 


In the endeavor to carry our range of information one step 
farther, forty-nine children of preschool age (2-9 to 5-10) were 
put through the tests previously tried on adults.1 Exactly the 
same dextrality types were found for these children as for adults, 
although it is true a slightly larger percentage of unstable re- 
actions was found in the ease of the children. ‘‘Unstable reac- 
tions’’ means that variations occur in the positions of the hands in 
batting or sweeping or that the unimanual preference is only a mild 
one. It is possible, of course, that even the young child is influ- 
enced by imitation in his use of the broom or bat, or that the re- 
action is conditioned by pure chance. But since these young chil- 


1Much of the experimental work for this study was carried on by Miss 
Caroline Wahlquist. We owe the opportunity to test the children to the kind- 
ness of Mrs. Ninna Hunton, Principal of Ivinson Hall, Miss Anne Winne, 
teacher of the kindergarten class, and Miss Ruth Adsit, Supervisor of the Uni- 
versity Grade School. 
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dren are obviously not practiced in such bimanual activities as bat- 
ting and sweeping (for many of them, these performances proved 
entirely novel), it is difficult to believe that such high stabilization 
as is found in many children is due to anything other than native 
organization. Little girls, for example, who have previously never 
handled a broom may be able to manipulate it easily only with one 
relative position of the two hands—with the right below the left 
or the reverse. To attribute the extraordinary awkwardness shown 
in the other relative position to imitation of an absent mother or to 
a chance conditioning of some sort, is not convincing. In cases of 
mild bimanual preference, imitation is no doubt capable of estab- 
lishing a type of reaction. 

The relative frequency with which each handedness formula 
occurs in the case of children cannot be determined from the small 
group with whom we worked, but the figures are given since they 
are of interest in themselves and are, perhaps, indicative of the 
results that might be expected from a larger group—namely, that 
the higher percentage of RRL formulas for women than for men 
would be duplicated in the results for boys and girls. 


Handedness Formula Scare of Se of 
IBASH teria tel | DI bAG.o!5, 6 ee cea Gore if 6 
jae bl Yicthave LS eee eee Cee 6 6 
RR Leandeiulik eres cece on etcas oteittacio cst 6 12 
Winstab loser stiserc cee wir te inte alee seis 3 3 
Lota eA cake een Penne ee 22 Dae 


Lert-HYEDNESS 


Approximately twenty to thirty percent of adults are left-eyed 
in sighting. The percentage is slightly larger in women than in 
men. It is a difficult matter to determine the sighting eye of a child. 
In the present investigation, the young subjects were asked to sight 
a colored disk through a small hole in a large sheet of white card- 
board. The eyes were covered alternately and the child asked 
whether or not he could see the disk. In addition, we used uni- 
lateral eye-closure and eye-opening.? Some children give abso- 


?Rider, W. ‘‘ Unilateral winking as a test of comparative visual acuity.’’ 
Transactions of Amer. Opthal. Society, Vol. 8, 1897-99). 
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lutely ambiguous results in the latter procedure; for example, they 
close the left eye easily and the right eye with difficulty or not at 
all, or they open the right easily, but not the left. If these 
results are in harmony with the sighting test, there is little chance 
of error in determining the fixating eye. 

Below I give a tabular summary of the determination of the 
sighting eye in the case of sixty preschool children, supplementing 
my records by fifteen placed at my disposal by Mrs. Woolley and 
Miss Stutsman of the Merrill-Palmer School. 


Number of Number of 
Boys Girls 
RAO N GH-OV OCs risers state otentisie misre lone tere tonstekerene rte 14 21 
Webt-evedinc sce darutoraselslavcmlekeu tetera 10 (2 left- 10 
handed) 
Inconsistent. ters ciomleinee wee ree 1 4. 
11 US) 28 IGS ROE or i AU cee AB go 7. 25 30 


The outstanding result here is the large number of left-eyed 
boys. 


Factors CoNnpDITIONING FAILURE IN Binet Test VI-1 


The main purpose of this report is to bring the observations 
concerning the handedness type and the sighting eye of the pre- 
school child into connection with results for Test 1 in the Stanford 
Binet six-year level—the test of the child’s ability to discriminate 
right from left by showing the examiner his right hand, left ear, 
and right eye. Three correct responses are necessary if the child 
is to score on the test, or five of six responses if the test is expanded 
to include showing left hand, right ear, and left eye. 

Accordingly, the Stanford Revision of the Binet Scale was given 
to 54 children ranging in chronological age from 2-9 to 5-10. Only 
two of the children who tested over the mental age of six were un- 
able to pass this test. Both of these children were left-eyed children 
(one possibly fixated with either eye). This result is of interest in 
connection with the fact that right-handed adults who are left- 
eyed give evidence of a greater amount of right-left confusion than. 
do those who are right-handed and right-eyed (or left-handed and 
left-eyed). 


DEXTRALITY TYPES 157 


Fourteen children, ranging in mental age from 4-10 to 5-10, 
passed Test VI-1; eleven children, falling within the same range of 
mental ages, failed to pass VI-1. It is interesting to compare the 
handedness and eyedness of these two groups. 


Formula Passing VI-1 Failing VI-1 
Levi gel acateho(hglMNhA Ops Se ton oe iNe coal Ses oe ae 4 I* 
Ri igpand sR eer er teint eicicr sion seek 3 3 
Ran du LR ere aio aie 5 6 
Minete blew ee sree an dae telat Cantey 2 1 
otal: cemmencpireemateca toads tet ieversfvers 14 11 


*A speech case. Possibly mild unimanual preference. 


Passing VI-1 Failing VI-1 
Hisht=-evedsverrmnceeses: oslo ceelter 8 3 
HLeEt OVO reetae ar cite eee ce tne eee 2 (1 left- 3 
handed) 
Rgi-CVCU See iata eee etnias oreo cals Sete iiaeten 2 5 
INot-determined 7s estes «chee Saionee nes 2 0 
14 11 


The groups are much too small to permit the drawing of any 
conclusion, but so far as they’ go, they suggest that degree of uni- 
dextrality influences the results in this Binet test. 

A determination, for large groups of children, of dextrality 
type and of eyedness, in connection with success or failure in hand- 
ling VI-1, would possibly give considerable information concerning 
the conditioning of success or failure in this test by special traits 
and would throw light upon the possible existence of at least one 
special ability or disability. It must, however, be borne in mind 
that an intelligent child learns to handle the right-left distinction 
by indirect means and, if unable to master it in terms of the greater 
kinesthetic readiness of one hand or the other, has recourse to visual 
or verbal tags. Furthermore, I believe that dextrality types are 
bound up with certain differences in intelligence and also with sex 
differences. For these reasons the problem of the significance of 
dextrality types is a very complicated one, but one which may be 
recommended to directors of nursery schools as deserving their 
fullest consideration. 
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THe Errect oF CoAcHING 


Besides tabulating the dextrality types of the children between 
the mental ages of 4-10 and 5-10 who succeeded or failed in passing 
VI-1, we coached nine of those who failed in the test in order to 
discover whether or not it would be possible to teach them to tell 
right from left. 

Four of the children (mental ages 4-4; 5-0; 5-4; 5-10) learned 
the distinction so that they were able to pass VI-1 without an error 
from six to ten different times (Nov. 17 to Dee. 10). The test was 
always given with six items. 

The results with the other five children were not so uniform. 
There were days when they passed the test, and other days when 
they failed. Three of the children were just on the borderline of 
complete success, that is, they never missed more than two items 
out of six. One of the two remaining children was an unstable 
child, LLL or LLR in type, and a speech case. It did not seem 
possible to teach him the distinction. The youngest child (C.A. 
2-9; M.A. 3-3) learned the distinction, then lost it during a period 
of illness, and had to relearn it. He passed the test easily at a 
mental age of 4-8. His verbal cues are interesting. He has been 
heard referring to his left leg as his ‘‘wrong’’ leg and to his right 
hand as his ‘‘easy’’ hand. 

The results from coaching have again only suggestive value and 
need to be repeated on large groups of children. In general, the 
right-handed children who were right-eyed had the advantage. 


CoNCLUSION 


In conclusion, I may say that I believe extensive testing will 
demonstrate that the age at which VI-1 may be passed is in part 
dependent upon the dextrality type and the harmony or lack of 
harmony between hand and eye, and that the preferred handedness 
type and the sighting eye are, usually, native traits. 


PREFATORY NOTE TO CHAPTER XII 


This chapter is the third in the series dealing with the factors 
that lie behind the appearance of special mental traits. The par- 
ticular trait considered by May and Hartshorne, the tendency 
toward deceptive behavior, is of interest because it is one of the 
few traits of a non-intellectual sort that have been studied by 
experimental and statistical methods. 

The material in Chapter XII represents only a portion of a 
larger investigation—the portion that throws some light upon the 
relative influence of nature and nurture in causing children to 
exhibit deceptive behavior. The chief issues are: do siblings re- 
semble one another more in dishonesty than unrelated children 
resemble one another; if so, is the resemblance due to living in the 
same home, imitating the same adults, being trained by the same 
parents, etc., or is it also necessary, in order to explain the re- 
semblance discovered, to assume that some portion of it must be 
due to inherited factors that underlie conduct? 

Comparing siblings reared in their own homes with orphanage 
siblings by means of a ‘battery’ of dishonesty tests, the authors 
seek to separate different environmental factors. The results show 
that siblings do exhibit resemblance in deceptive behavior, and that 
this cannot be adequately explained by the social status of the 
homes in which they are reared. After cancelling the environ- 
mental factors, there seems to be left, in the case of deception, a 
degree of sibling similarity as great as that discovered by the best 
intelligence tests; that is, it seems as likely that deception is bio- 
logically conditioned as it does that intelligence is biologically 
conditioned.—E ditor. 
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CHAPTER XII 
SIBLING RESEMBLANCE IN DECEPTION? 


Mark A. MAy 
Yale University, New Haven, Connecticut 
AND 
HucH HARTSHORNE 
Institute of Educational Research, Teachers College, 
Columbia University, New York City 


In the course of an extensive study of deceptive behavior, the 
writers have been attempting to uncover possible causes of this 
widespread social maladjustment, and to determine their relative 
importance. Among such potent factors is, of course, parental in- 
fluence, as mediated by the conduct of the father or mother, by 
their treatment of the children, by the children’s treatment of one 
another, and possibly by biological heredity. The amount of these 
combined factors is roughly indicated by the correlation of siblings, 
inasmuch as individuals paired at random from the community 
would correlate zero, and family background and experience is all 
that siblings, by and large, have in common. But such a coefficient 
does not in itself discriminate between social and biological fac- 
tors, when it refers to siblings living in their own homes, since both 
social and biological factors are at work on each pair, or each 
group, of sibs in ways peculiar to each family. Completely to dis- 
entangle these two lines of influence would require the measurement 
of brothers and sisters who had been absented from the influence 
of their own homes from earliest infancy. Indeed, the removal of 
one member of each pair from family influence soon after birth 
would be sufficient. Supplementary data could be secured from a 
study of foster children brought up in the same home and of 
‘‘identical’’ twins (of like sex) as compared with non-identical 
twins and ordinary siblings. 

The data reported in this article are not so comprehensive as 
this, and do not afford conclusive evidence regarding the relative 


1The study here reported is part of an investigation which is being car- 
ried on by the Character Education Inquiry, Teachers College, Columbia Uni- 
versity, in codperation with the Institute of Social and Religious Research, 
New York, and will appear in more complete form in its final report. 
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significance of heredity and environment. But enough light on the 
problem is offered by the resemblances actually found to justify 
us in reporting them. 


THe NATURE OF THE DATA 


The deceptiveness, or dishonesty, of several thousand school 
children has been measured objectively, and for the most part in 
quantitative terms, by placing them in real life situations and re- 
cording their responses. The situations include (a) classroom tests 
in which the child deceives concerning his score, (b) athletic con- 
tests in which he deceives concerning his ability to score in an 
‘event,’ (c) party games in which he deceives concerning his 
achievement, (d) a home test in which he deceives concerning work 
done. Each of these general types of situation has one or more 
corresponding techniques. 

For convenience of administration, the test situations are com- 
bined into batteries according to the techniques employed. There 
are four such batteries for measuring deception in classroom situa- 
tions, one for athletic contests, one for party games, and one for 
home work. This study is confined to classroom and home dis- 
honesty, since the data 1 for siblings on other situations are at present 
inadequate. 

The techniques used may be classified and described in terms 
of the four respective behaviors measured, as follows: 

Behavior A. In one type of classroom situation the pupils de- 
ceive by adding more scores to a speed test after time is called. 
The amount which any score is thus illegitimately increased beyond 
what it would otherwise be is taken as the measure of deceit. There 
are six of these tests, affording six opportunities to cheat. The 
total amount of dishonest increase is reckoned as the total score in 
this battery. 

Behavior C. In this type of situation the pupil deceives by 
copying from a key or answer sheet and thereby increases his score 
on some school test beyond what he could achieve without such 
assistance. Three intelligence tests are used in this battery and 
the total dishonest increase is taken as the total deception score. 


Behavior P. This battery consists of three ‘peeping’ tests with 
techniques adapted from Voelker and Cady. The pupil deceives 
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by opening his eyes, or peeping, when the directions call for doing 
the test with eyes closed. The total amount done over and above 
what could be done with eyes open is the total score.? 

Behavior H. This consists in securing help on a test taken at 
home, after being definitely instructed not to get any help from 
any source. Deceit consists in getting help. The amount is de- 
termined as in the case of Behavior C. 

All these tests were standardized by giving them to a large 
number of children under such strict supervision that cheating was 
impossible or extremely unlikely. The means of such distributions 
were used as points of reference, and their standard deviations as 
units of measurement, for the cheating tests. Thus, all cheating 
scores are in terms of the standard deviation of honest perform- 
ance. All score figures in Table I are in these terms and hence 
comparable for all tests. 


SouURCES OF THE DATA 


The populations from which the sibling data in this chapter are 
drawn are as follows: 

Population I. A suburban school system near New York con- 
sisting of five elementary schools and one secondary school in which 
all children in Grades IV to VIII were tested with Behavior A 
technique; and all in Grades V to VIII with Behavior C and Be- 
havior H techniques. In this town both members of each pair of 
siblings either attended the same elementary school, or, more fre- 
quently, one of the pair attended an elementary school and the 
other the secondary school. This community offered a wide range 
of home environments, ranging from homes of comfort and affluence 
to those of abject poverty. 

Population II. A much smaller suburb of another city in which 
there were two schools. All children in both schools in Grades IV 
to VII were tested with Behaviors A and P techniques. In every 
case both members of each pair of siblings attended the same 
school. The range of home environments here was not so wide, 
lacking the lower extreme. 


2 The total amount done is used in this study, as the amount done honestly 
was too small to affect the correlations, 
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Population III. Four public schools in a congested city district 
were tested with Behaviors A, P, and H in Grades IV to VIII, ex- 
cept that Behavior H was not given to fourth-grade children. 
These children were racially homogeneous, with home environment 
rather uniformly low in the economic and social scales. Both mem- 
bers of each pair of siblings attended the same school. 


Population IV. In a mid-western town of 200,000 inhabitants 
Grades V and VI of two representative elementary schools and 
Grades VII, VIII and IX of a junior high school were tested with 
Behaviors C and H. These children came from a rather wide range 
of homes. 


Population V. This isa junior high school in a large suburb. 
All pupils in Grades VII, VIII and IX were tested with Behavior 
P. The home backgrounds of these children were distinctly above 
the average. 


Populations VI and VII, Two groups of orphans were tested. 
Both were in orphan homes and both represented the same type of 
environment as found in Population III. One institution, VI, is in 
a large city; the other, VII, is in the country. In the former we 
tested one hundred pairs of siblings with Behavior C; in the latter 
we tested forty-five pairs with Behaviors A and P. 


RESULTS 


Table I conveniently assembles the pertinent data and statistical 
results. Column 1 gives the population, referring by Roman nu- 
meral to the groups already deseribed. Column 2 is the number of 
pairs of siblings in each case. Column 3 gives the correlations be- 
tween pairs.* Column 4 gives the standard deviations of the scores 
of all the cases. Column 5 gives the standard deviations of the 
differences between the paired scores. Column 6 gives what the 


®In computing these correlations all combinations were inserted where 
there were more than two siblings in a family. Also each pair was entered 
twice; each individual thus appears on both the X and Y axis of the scatter- 
gram, making the plots symmetrical, with N equal to the number of entries. The 


work of correlating was facilitated by the use of the formula: r=1— _*% 
which is derived from the Pearson product-moment formula, og, (Column.5) 


being the standard deviation of the differences and o (Column 4) the standard 
deviation of the entries. 
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sigmas of the scores would be if the pairs were selected at random 
and the resulting correlations were zero. Column 7 is Column 6 
minus Column 5. Column 8 is Column 7 divided by Column 6. 


Tasie I.—SratisticaL Resuuts or StBiine Strupy* 


(1) (2) (3) (4) (5) (6) (7) (8) 

Population|N (Prs.) r o od Odo 6-5 Pet. of 

7 to 6 
Behavior A I 76 -220 + .05 3.27 4.09 4.62 .53 11 
Speed II 70 .292 + .05 1.59 1.89 2.25 . 36 16 
III 224 .208 + .03 3.42 4,29 4.83 . 54 11 
I, II & Ill} 370 p20) =) 02 3.00 3.72 4.24 52 12 
Vil 43 .3800 + .07 2.65 3.12 3.75 63 16 
Behavior C I 108 445 + .04 1.46 1.54 2.02 .53 26 
Copying IV 138 .483 + 203 1.50 1.59 2.12 .53 25 
I and IV 246 .440 + .03 1.38 1.46 1.96 -50 25 
VI 94 .333 + .04 1.82 2.09 2.57 48 18 
+% 38 Ooo st 00 1.73 2.00 2.45 45 18 
-\ 46 .333 + .06 2.03 2.34 2.86 52 18 
Behavior P Ill 239 022 + .03 2.62 3.04 3.70 . 66 17 
Peeping II 74 -271 + .05 3.02 3.65 4.27 62 14 
x Vv 89 oat “ae al0)s) 2.80 3.39 3.96 57 14 
SEE, Vi 402 -400 + .02 2.86 3.15 4.04 89 22 
VII 43 ne OO) st) 107 2.73 3.22 3.86 -63 16 
Behavior H I 104 -641 + .03 2.47 2.07 3.49 1.42 40 
Help IV 88 472 + .04 2.25 2.30 3.18 -88 27 
1 Ih 94 -695 + .03 2.13 1.69 3.01 1.32 43 
Iilz 55 .832 + .02 2.73 1.58 3.86 2.28 59 
Le a, 345 sh OOo sees OE 2.62 2.01 3.70 1.70 46 

*Note the following relations between the columns of Table I: 
Column 3: ey I 8 Column 5: og=oV2(1—r) 
oe Column 6: ga =oV2- 
od Col. 7 ~— 
Column 4; ¢ = —————_ Column 8: =1—yl—r 
v2(1—r) Col. 6 


INTERPRETATION 


Before indicating the bearing of the facts of the table on various 
hypothetical explanations of sibling resemblance, certain features 
should be noted. The important comparisons are between the dif- 


“Column 8 shows the percentage of restriction in range of difference be- 
tween siblings compared to range of differences as found in uncorrelated pairs. 
The greater the figure in Column 8, the greater the restriction of range in 
differences or the greater the resemblance of the pairs. The figures of Column 
5 depend on both the correlation between pairs and the sigma of the scores. 
These sigmas are in the same units in each section of the table and are com- 
yarable among the tables. We may thus compare the resemblances shown by 
Columns 7 and 8 with the resemblances indicated by the correlations alone. 
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ferent behaviors on the one hand and the different populations on 
the other. First, as to behaviors, Behavior H, which is the test 
taken at home, elicits the greatest resemblance between siblings, 
doubtless partly because of collusion. Behavior C comes next, per- 
haps because the copying type of deception is more consciously 
dealt with by parents. Behaviors A and P come last, but still show 
genuine resemblance. 

Second, as to populations, the orphans, VI and VII, were not 
given Behavior H, but of the other populations, the congested 
population, III, and III,, shows the greatest resemblance. In 
III, and III, two schools are combined, a better and a worse in 
each case. The average r for all of III is .763. In the other be- 
haviors little difference among the normal populations is found, 
but in the case of Behavior C, there is considerable difference be- 
tween the normal and the orphan groups. 


POSSIBLE EXPLANATIONS OF SIBLING RESEMBLANCE 


We have now to account for these likenesses and differences 
among the populations and point out their significance for the inter- 
pretation of the sibling resemblance. We shall discuss three possi- 
ble explanations: collusion, common environment, and heredity. 

1. Collusion: This is possible only in Behavior H, the task 
done at home. Here we may expect that some children will either 
help one another or encourage each other in the use of forbidden 
aids. The size of the sibling r’s bears out this expectation. For 
all populations it is .705 and runs as high as .83 in one group con- 
taining fifty-five pairs. This is considerably higher than the relia- 
bility of this single test and indicates that there is greater likelihood 
that two siblings will cheat in equivalent amounts on a single 
occasion than that the same child will cheat in the same way on 
two occasions. If mere general home influence were the prepotent 
influence, this would not be the case. As a matter of fact, general 
social-cultural level, as determined by an experienced case worker, 
correlates .03 with Behavior H, showing that whatever makes sib- 
lings resemble one another in deception on home work is not the 
kind of thing in which homes as such differ from one another, so 
far as we have measured them. 


= 
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Furthermore, if siblings helped each other on the home test or 
got help from a common source, one might expect the older to exert 
the greater influence. This cannot be shown from our data, but 
it is of interest that, although there is no great difference between 
the mean of the older and the mean of the younger, yet in a popula- 
tion of 119 pairs, 9 percent were identical, in 61.5 percent of the 
pairs the younger cheated more than the older, and in 29.5 percent 
of the pairs the older cheated more than the younger. The older 
and younger were nearly alike in variability. 

2. Common Environment. There are, of course, other home 
factors than collusion, direct assistance or example that make 
for honest, rather than dishonest, behavior, or vice versa (such as 
code, ideal, or general social stability), so that one might expect 
that two children reared in the same home would, by sharing certain 
common experiences, become more alike than if one were reared in 
one environment and the other in another. Of course, ‘the home’ is 
not really identical, even for its own children, but certainly the 
range of social experience offered by a large number of homes is 
far wider than that offered by one. If deception is correlated with 
these variable factors, the inevitable result will be that siblings 
will resemble one another more than the children of other families 
by an amount expressed by the coefficient of correlation between 
members of pairs. 

The populations reported in the table present about as wide 
divergence as is likely to be found in cities and suburbs. The in- 
clusion of rural groups would extend the range, but it is wide 
enough as it is to produce correlations so far as these may be due 
to social influence, and to allow for variation in the size of correla- 
tions as longer or shorter sections of the entire range are taken for 
comparison. So far as deceptive behavior is built into habit sys- 
tems and is not a mere matter of collusion or example in the home, 
it will appear in the correlation of tests taken away from home. 
We have already pointed out the existence of such correlations as 
are found in Column 38 of our table. 

Let it be clear that if the correlation between home environ- 
ment and deception on tests taken away from home is due to influ- 
ences of the home which vary with social levels, then, 


168 THE TWENTY-SEVENTH YEARBOOK 


1. The wider the range of homes reported in a given popula- 
tion, the greater will be the variability in deception; and the nar- 
rower the range of homes, the smaller the variability in deception. 

2. In like manner, the range of homes in the different popula- 
tions will show a correlation with the different sibling correlations; 
that is, the low sibling r’s will be observed in the populations with 
the narrower range of homes; and the larger sibling r’s will be 
found in the populations with wider range of homes. 


Altering the Range of Environment 


There are three available ways of expanding or contracting the 
range of environment in order to test this second hypothesis of the 
social causation of our r’s. One is to select a population which 
all judges would agree is relatively homogeneous. A second method 
is to measure home background and by applying the partial corre- 
lation technique keep this influence constant. A similar method is 
to select as a basis of study the cases falling. within certain sections 
of the home background scale. All three methods have been em- 
ployed and will be reported. 

Method 1: First it is necessary to indicate roughly the range of 
socio-economic background of Populations I to VII. 

The ranges of home background reported in the various popula- 
tions tested have been determined partly by test and partly by 
observation and common consent. Their relative ranges are shown 
in Figure I. 


Population Crude Average Cultured 
Abject Affluent 

VI-VII 

ir 

I 

VW 

IV 

I 


Fig. I.—RANGE OF HOME BACKGROUND FOR THE SEVEN POPULATIONS 


* The test used was a questionnaire standardized and validated by Dr. V. ~ 
M. Sims (and issued by the Public School Publishing Co.), as a measure of 
socio-economic level of homes. It is administered in school. Sims reports its 
reliability as .94. 
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The range of background for the two orphan groups is really 
much narrower than indicated in the chart, for no matter what the 
range of background from which they came, they had for some 
time been living in one environment, as though all were children 
in one big family. 

In width of social range the populations rank roughly: VI-VII, 
III, Il, V, IV, I. To the extent that the social factors thus dis- 
tributed are determinative of deception, we should expect a corre- 
sponding variability in deception range, or deception sigmas, and 
in deception r’s. The table shows that this expectation is not 
realized. For convenience the sigma and correlation ranks are 
gathered in Table II, the columns of which are as follows: Column 
1 gives the rank order (slightly modified) of the various populations 
in spread of home background as in Fig. I. Columns 2, 4, 6, and 8 
give the (modified) rank order of these same populations in the size 
of sigmas of their deception scores for Behaviors A, C, P, and H, 
respectively. Columns 3, 5, 7, and 9 give the rank-order in size 
of r’s between siblings for the same four behaviors. 


TaBie I].—Rank OrperR or PopuLaTions IN SociAL VARIABILITY, DECEPTION 
VARIABILITY, AND SIBLING CORRELATION 


Rangc oA Ta Oc To op Tp oH Ty 
AG UL re Pate SARS a il 3 Dre) eae Seca: | aetna 1 PE eer cae teers, 
Wil Sete set LI estes a\iis, oe te 2 Li erate | eer et etal seaatiede ce| are 
Piece oes 2 6 DF eee | tories. 1 3 2 5 
LEP reeks ates 3 1 Die Sikap eh ava eee 3 Dee yeenene | Me hegees 
Wid osortaeus hs 7 Sees cet] lee hepa ie ean CRI 2 PS sete | sccgoretens 
Vets Eee (Sh al ieasig Seal ee rea 1 eA ateretetevalteraesltede i 1 
Ne So? ce ere 8 5 1 1 Sido Molede ete ieniet aces 2 3 


From Table II it can be seen that there is no correspondence 


between deception range and social background range, except in 
the case of Behaviors P and H, which show a slight tendency toward 
equivalence in range. We infer, therefore, that there are factors 
other than general socio-economic level at work to promote de- 
ceptive habits. 

The same conclusion is reached from the relations among the r’s, 
for in only two cases do the r’s increase with range of background, 
viz., Behavior C and Behavior H. In the latter case Population 
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III is an exception to this tendency, showing the highest r between 
siblings in a narrow range of home background, though here the 
difference may be accounted for by collusion. 

Even the orphan population shows a greater variability than 
Population II and a larger sibling r than Populations III or I in 
Behavior A, a greater variability than Populations IV or I in Be- 
havior C, and a larger sibling r than Populations V or IV in the 
ease of Behavior P. Also in the orphan population, VI, the children 
who had been living in the institution more than half their lives 
(‘‘+14”’ of Table I) show the same sibling r as those who had been 
in the institution less than half their lives (‘‘—14”’ of Table I), 
and only slightly less variability. If gross variation in environment 
is producing the sibling correlation, one would expect the r to be 
lower in the case of those who had been longer in the institution. 

Method 2: The second method of testing the environmental 
hypothesis involves securing some measure of the variability of 
home background with a view to determining its correlation with 
deception. If we can arrive at even an approximation of the corre- 
lation between variation in homes and deception, we can then use 
the partial correlation technique and hold the home variation con- 
stant in the sibling r’s. 

We have made use of three ways of measuring the socio-economic 
level of homes. As a first crude approximation we used the occu- 
pation of parents. We ranked the occupations in three groups— 
professions, trades and erafts, and unskilled labor. The relation 
between this grouping and the deception (Behavior C) of the 
children is about .30 (coefficient of contingency). This determina- 
tion was made on the two populations, I and IV, where the range 
of homes is widest. 

In two populations, II and III above, we measured the socio- 
economic levels of the pupils with the Sims score card. In one of 
the lower ranges of homes, Population III, it yields a correlation 
with cheating of only .115. In the upper range the r is about .20. 
If all ranges are combined so that we have a kind of unselected 
sample of homes, the correlation between Sims score and cheating 
rises to .30, which is about the same as we got with the crude 
occupational measure. 

In another study we employed a trained social worker to visit 
the homes of the most deceptive and the least deceptive. She made 
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a very careful study of over 150 homes to find, if possible, the 
details of experience and environment that were influencing the 
conduct of the children. These homes were grouped in fifteen 
levels according to their quality as homes. The correlation between 
these homes, thus rated, and the deception of the children is .30,° 
again the same as found by the other two procedures. But the 
regressions here are not linear. The correlation ratio of homes to 
cheating shows an eta of .454, and that of cheating to homes, one of 
440. In the case of the Sims scores the regressions were linear. 
We shall return to this point later. 

These figures indicate that the correlation between range of 
homes as measured in these three ways and cheating in school is 
around .30. We can now find the partial r’s between siblings in 
deception with home background (as far as we have measured it) 
constant. But since the above r of .80 represents a wide range, we 
should use only r’s from Table I which are also on wide ranges. 
The resulting partials are given in Table ITI. 


Tasrie III.—Sisring ResemMBLANCE witH Home BackGRouND CONSTANT 


Behavior Poprlation Observed r Partial r* 
Ae ele specec ate isislelatal ead te eer cate cies toss li QU ia aE 225 1438 
ORs Ee Sea SRS Baca ae tata pl AY 440 385 
NRE e, ee tue eae TST as ebokats oe) archaic eS II, III, V 400 341 
AEC Se Daca eee ite toate Sates 470 394 


*Based on the assumption that the r between siblings in home background is 1.00. See foot- 
note No. 7, 


The r’s indicate that children who come from homes of the same 
general social level still show correlations between siblings. 

Method 3: As a check on this we took from Populations IT and 
III all those who deviate more than about two sigma from the 
Sims socio-economic means. We found the r between the siblings 
for classroom deception beyond this extreme in the upper range 
to be .35 and beyond this extreme in the lower range .30. We then 
took a ‘slice’ out of the middle, including all those between about 


*We recognize that a correlation obtained from the measures of the ex- 
tremes in one variable is somewhat spurious. But our aim here is to arrive 
at only an approximation of the correlation between variation in homes and 


deception. 
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+.3 sigma and —.3 sigma and still got a correlation for classroom 
deception of about .385. These determinations check and verify the 
partials. 


Further Criticism of the Common Environment Hypothesis 


Our study of the influence of a common environment on sibling 
resemblance has so far led us to the conclusion that, while such 
factors as we have been able to measure or observe are doubtless 
significant, they do not account for all the resemblance found—that 
there is left about as much likeness as is found in the ease of sibling 
resemblance in intelligence (as will be shown presently). Before 
turning to the consideration of the other proposed explanation, 
there is one more technique for the evaluation of our data which 
will be suggestive in showing how inadequate the observed relations 
between deception and home background are to account for sibling 
resemblance. 

In the first place, it can be shown that the correlation between 
home background and deception must equal the square root of the 
correlation between siblings before it can completely account for it. 
The relation is rsc = Yen, IN which r,, is the correlation between 
sibs in cheating and r,, the correlation between home background 
and cheating.’ 

We have estimated that the correlation between siblings for 
Behaviors A, C, and P, combined, is not less than .47. If this is 
to be wholly accounted for by virtue of the type of social influence 
measured, then the correlation between home background and cheat- 


‘This simple relation is derived as follows: 
*dse 

Let rz, =1— a 

aenase 207e 


the difference between siblings in cheating. 
2, 


Let ren = 1— —ag_» where o’ac is the standard deviation, squared, of the 


, where o*dse is the standard deviation, squared, of 


arrays of cheating for any home level, and o’c the standard deviation, squared, 
of cheating. 

It can be shown that o?dsc = 2o’ac. 

Then, by substitution, rz. = 17, 

This can be checked by the formula for partial correlation in the case of 
Tse — he ch 
PCE | 
formula for partials and is applicable on the assumption that the correlation 
between siblings in environment is 1.00. 

Now, when rg¢ = Ten, then Tgc.n — 0. 


siblings, which is Trge.n = This formula is a special case of the usual 
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ing should be \/.47, or .685. Our measures and our estimates by 
an expert afford a correlation of only .30 between deception and 
home background, which is enough to account for an r of only 
about .10 between siblings. In other words, there is considerable 
resemblance still to be explained. 

The second application of this technique leads up to our final 
suggestion. 

As has been noted, our measures of home background are not 
perfect. It may well be that, with more refined methods for reveal- 
ing other factors than those included in the three methods de- 
scribed, the correlation between deception and home background 
might be increased to a point sufficient to account for the correla- 
tions between siblings. Suppose for a moment that this has been 
done. Then in the case of Behavior P, where r,, = .40, the corre- 
lation between cheating and home background would have to equal 
\/.40 or .63 to account fully for the .40. Let us suppose that this 
is the case, and that the r of .63 is based on an unselected popula- 
tion covering a wide range of environmental levels. Then, if we 
reduce the variability of the home background (as measured by 
our hypothetically adequate test) by one half, we shall automatic- 
ally reduce the r of .63 to .45, and the sibling r from .63?, or .40, to 
45? or .21.8 This .21 is the predicted r between siblings from a 
population when the standard deviation of the environment variable 
is reduced one half and the r between cheating and the full range 
of the environment is .63. 

Now, as a matter of fact, taking our rough range-of-background 
chart as our measure of the relative range of different groups, 
Population III shows about half the variation of the total range, 
and Table I shows that the correlation between siblings for Be- 
havior P in Population III is .82, which is only .11 higher than the 
predicted .21. Also, Populations IJ and V, likewise greatly re- 
stricted in scope, show r’s in each case of only .27. These three r’s 
have P.E.’s of .03 for III and .05 for II and V, so that the predicted 
r of .21 is within 4 P.E. of one of the observed r’s and within 2 P.E. 
of the other two. 

The meaning of this is that, in the case of Behavior P (peeping), 
whatever resemblance there is between siblings may well be ac- 


* For formula, see Kelley, T. L., Statistical Methods, p. 225, Formula 187. 


<r 
s 
= 
ad 
TT 
wc ST A 


ne a, Ee, ee Tee Gee a ee 
=> PEs ‘t= 


— ie 


cine ee oe os ae a ee ee ae 


176 THE TWENTY-SEVENTH YEARBOOK 


read or to translate Latin? The relatively innate factors® would 
include temperament, emotional stability, power of inhibition, re- 
sponse to persons as compared to things or ideas, and general in- 
telligence. 

It is commonly supposed that the correlation between the I.Q.’s 
of siblings in a homogeneous age population of unrestricted range 
is around .50. Miss Hildreth? summarizes work done up to 1925, 
showing that the actual results on over two thousand pairs run 
from .30 to .63. These yield an average of .48. Gordon’s data on 
219 pairs of orphans, when recalculated by Elderton by entering 
each pair twice in the seattergram, give a correlation of .467. Miss 
Hildreth reports .322 as the obtained r between 253 pairs of 
orphans. These two average .3888. 

Our deception correlations, when corrected for attenuation, 
show comparable results. Extracting from Table I the r’s obtained 
on populations of widest range and correcting these for attenua- 
tion, we get Table IV. 


TaBie [V.—CoEFFICIENTS OF CORRELATION BETWEEN SIBLINGS IN DEcuEPTIon, 
CoRRECTED FOR ATTENUATION 


r Corrected for 


Population Behavior Raw r Mefenn ation 
Pal Tilsen ae ioe A 225 256 
Li ARE eR Gcina GS aa Oobindon pe C .440 .510 
10 PGW Ve Set smcin cil ricer 1 .400 .470 
Mbheoreticalaaeemeemnerie sak A+C+P .470 .495 
OrphanspVillaeres asc aeee A .300 .341 
Orphanszt Viewers. pretence Cc .333 .387 
Orphans pVilleecrr ws vers eee iP .300 .358 
‘DHEOTELICAIn. ane eee A+P+C sei) .390 


Thus, the resemblance between school siblings in deception is 
about the same as it is in I.Q. Furthermore, it shows the same 
amount of independence of home background range as is shown 
by resemblance in intelligence. These facts may mean that the 
tendency to deceive is conditioned by nature and nurture in about 
the same proportion as is general intelligence. 

*It is of some importance that liars have been found to be ‘‘more sug- 
gestible, more imaginative, less steady in motor control involving an emotional 
disturbanee.’’ See the study in untruthfulness by Slaght, University of Iowa 
Studies in Character Education No. 4, referred to by Starbuck, E. D., Re- 
ligious Education, January, 1927. 


10 Hildreth, G. H., The Rescmblance of Siblings im Intelligence and 
Achievement. Teachers College, 1925. 
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SUMMARY 


Seven hundred thirty-four pairs of siblings scattered among 
seven school populations were measured with four different types 
of deception tests. The general socio-economic level of the homes 
was measured in two populations by a scale made for this purpose 
and by the careful estimates of a trained school visitor in one 
population. Two populations were orphans from one of the com- 
munities already measured. The range for the other two popula- 
tions was estimated. 

These siblings were found to resemble one another in deception, 
whether the opportunity to cheat was offered at home or at school. 
The resemblance in deception on the test taken at home was shown 
to be due chiefly to collusion. Two theories were proposed to ac- 
count for the resemblance between siblings in deception on tests 
taken in school when collusion was impossible, viz., (1) the influ- 
ence of environment, through which siblings develop the same 
standards and habits of school behavior, and (2) the presence of 
a common hereditary factor or group of factors in children of the 
Same parents showing itself in similar deceptive behavior. These 
two theories are not mutually contradictory. 

The data assembled show clearly that the coarser differences 
among homes in socio-economic status, although associated some- 
what with differences in deception, do not fully account for like- 
nesses between siblings. So far as the home training is responsible 
for the resemblances observed, it must be due to subtle factors not 
incorporated in the home ratings of a trained case worker. Fur- 
thermore, since orphan siblings show resemblances which cannot 
be attributed to present home background, what home influenee, if 
any, is at work must be exercised in early childhood. 

A biological factor is thus suggested as a possibility which is 
made the more plausible in view of the fact that the resemblance 
of siblings in deception is about the same as that between siblings 
in intelligence, and shows similar stability amid home backgrounds 
of varying range as is shown by resemblance in intelligence. What- 
ever arguments are adduced for or against the biological causation 
of sibling resemblance in intelligence are applicable also to the 
interpretation of sibling resemblance in deception. 


PREFATORY NOTE TO CHAPTER XIII 


One may not at first see the relation of this chapter to the 
nature-nurture problem, but the relation will be more evident if 
the problem is conceived as that of the possibilities and limitations 
of training. If training of a particular mental function has no 
influence on efficiency in other mental functions, it is obvious that 
the total effect of the educative processes, in school and out, will 
be less than if transfer takes place. It is not, to be sure, a ques- 
tion of the improvement or non-improvement of original endow- 
ment, but of the extent to which original endowment in a particu- 
lar line is realized by training in other lines. Life is too short 
and the kinds of mental performances are too numerous to enable 
anyone to develop every capacity to its limit by specific training; 
so it is important to know whether and to what extent general effi- 
ciency is influenced by the development of special efficiency. 

The conclusion of the author that some degree of transfer does 
take place, and that ‘‘some of the most important agencies of 
transfer are to be found among the higher-level relations, in gen- 
eralized attitudes, words, ideals, sets, and ways of going about 
mental operations,’’ seems to be justified by the experimental litera- 
ture on the problem. If this is true, it is possible that formal 
training may have an effect upon the I.Q. We would certainly 
be justified in so concluding unless it could be shown that the ordi- 
nary experiences of everyday life and of schooling of any reason- 
able kind themselves give the total amount of general training 
which it is possible to secure. Whether the more formal kinds of 
specific training are necessary to accomplish this result must still 
be held an open question. 

The summary has been organized in six sections. Readers who 
are primarily interested in the final conclusions will save time by 
turning directly to the sixth section, or better, to the fourth, fifth, 
and sixth sections. Readers who question the validity of the sum- 
maries and conclusions therein presented or who are unfamiliar 
with the history of this aspect of educational psychology will per- 
haps find it worth their while to peruse the entire chapter. 

—L.M.T. 
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CHAPTER XIII 
THE TRANSFER OF TRAINING 


Guy M. WHIPPLE 
Danvers, Massachusetts 


It is the purpose of this chapter to set forth the facts concern- 
ing that aspect of the general problem of the possibilities and limi- 
tations of training which is characterized by the stock phrase, ‘‘the 
problem of formal discipline,’’ or ‘‘the problem of the transfer of 
training.’’ It will be desirable to consider (I) what the problem 
is, (II) the methods and materials used in its investigation, (III) 
the results of typical investigations, (IV) what has been proved 
concerning the amount of transfer, and (V) what has been proved 
concerning the agencies of transfer, from all of which may be 
drawn (VI) certain inferences concerning the pedagogical signifi- 
cance of transfer. 


I. THe PrRosplemM or TRANSFER 


Suppose a stimulus, r, impinged upon some sense organ and, 
after passing through some central portions of the nervous system, 
some associative neural connections, a, finally, by means of some 
motor neurone, set into action some motor response, m. This 
schema may be thought of as picturing the theoretically simplest 
sort of sensory-motor response. Suppose that this sequence of 
activity—r, a, m,—at first operated haltingly or feebly or inaccu- 
rately, but that its operation resulted in no arousal of pain, rather, 
on the contrary, produced some sort of satisfaction to its possessor. 
If, now, r is repeated, there is a definite tendency toward the 
arousal of the same sequence of activities, and if r is repeated often 
enough or intensely enough or in sufficiently close succession, we 
may expect the sequence eventually to operate unhesitatingly, 
strongly, and accurately. We then say that this response (m fol- 
lowing upon 7) has been improved by practice and eventually, if 
it continues to operate in this improved manner, that it has been 
‘learned.’ This process is the process of training, reduced to the 
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simplest terms. Thus, for example, the infant ‘learns’ after a num- 
ber of trials to suckle his mother’s breast successfully or the school- 
boy learns to answer ‘‘six’’ to the question ‘‘How much is four 
plus two?’’ 

Now, it is possible to conceive of the entire operation of the 
nervous system (or of our mental activities, if we prefer to talk in 
psychological terms) as representing an enormously complex aggre- 
gation of hundreds of thousands of such ‘stimulus-association- 
response’ acts. The suckling response might have been Number 
34 that the infant encountered and perfected; the answering ‘‘six’’ 
to ‘‘four plus two’’ might have been Number 635,782 of the series 
making up his life. It is also possible, as a matter of psychological 
theory, to assume that each one of these hundreds of thousands of 
responses, or learned acts, is an independent unit, that it operates 
in and by itself, that it has to be experienced and perfected ab 
initio, without any reference to the other hundreds of thousands 
of responses. On this assumption, the particularity, or the speci- 
ficity, of the individual response is complete; its relationship to 
other responses is zero psychologically (whatever it might be 
logically). On this assumption, the problem of transfer disappears, 
since it becomes obvious that there can be no transfer; the im- 
provement of any single response will never bring about the slight- 
est improvement in any other response. 

It is possible, once more as a matter of psychological theory, 
to assume that any one of the hundreds of thousands of responses, 
or learned acts, may be to some extent neurologically (or psycho- 
logically) ‘hooked up with,’ or interrelated with, numerous other 
responses. On this assumption, no given response can be positively 
guaranteed to be purely particular in its operations, because the 
chance that any given response is so completely dissimilar in its 
nature and operation from every other one of hundreds of thou- 
sands of responses as to be thus totally unrelated to any one of 
them is obviously exceedingly small. This is only another way of 
saying that responses actually are complex and mutually over- 
lapping. It is only another way of saying that the simple schema 
—r, a, m—with which we started is inadequate to the real situation. 
Thus, it requires no wild stretch of the imagination to suppose that 
if Response No. 635,782, previously mentioned, had been perfected, 
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and if Stimulus No. 635,783 were ‘‘How much is two plus four?”’ 
the appropriate responsive act—saying ‘‘six’’—might be forth- 
coming with no previous training at all. Or, again—drawing 
heavily, perhaps, on our imagination this time—supposing that in 
perfecting Response 34 (learning to suckle effectively) the infant 
in some vague organic way discovered that it ‘‘paid to keep at it’’; 
then, is it not conceivable that at the age of seven the same 
youngster might on that account exhibit a very creditable degree 
of persistence in learning his sums? Let us suppose, anyway, for 
the sake of argument, that that is what happened. If so, we must 
rearrange our original schema, at least so far as to add some fac- 
tor, X, which stands for: ‘‘ When confronted by an obstacle, keep 


34 635,782 


at it.’’ Now, all we have to assume is that learning the sum of 
four and two presents a slight obstacle, which immediately brings 
into the learning process the by-now-well-correlated ‘‘keep at it,’’ 
with the result that the new acquisition is accomplished, though 
conditions are such that otherwise it would not be accomplished. 
Here, then, we do have real transfer of training. The factor, 
X, common to the two seemingly utterly diverse situations (34 and 
635,782), which was developed or improved when 34 was operating, 
is now a potent factor in making 635,782 operate. This is the sort 
of thing that takes place, according to many writers who champion 
the ‘common element’ theory of transfer. 

Between these two theoretical extremes, the complete specificity 
of psychophysical activities on the one hand and the complete inter- 
locking of them on the other, we may, of course, assume the opera- 
tion of any assignable intermediate degree of interrelationship. 
And since no one seriously champions either of the extreme 
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hypotheses, the actual problem of the transfer of training becomes 
largely a problem of degree—and, of course, of adequate descrip- 
tion of the underlying conditioning factors. That is, how much 
transfer is there? Under what conditions does it take place? 
Under what conditions does it not take place? In short, there are 
two chief problems confronting the experimentalist—the amount 
of transfer and the agencies of transfer. 

Both of these problems are of decided theoretical interest—it is 
difficult, for example, to see how one can develop his notion of 
the nature of mental operations without committing himself to 
some sort of formulation of their organization into certain more 
or less independent and other more or less interrelated mental 
functions. 

Both of the problems are likewise of decided practical interest, 
as, for instance, in laying out a satisfactory program of mental 
training, in deciding what forms of pedagogical instruction are 
needed to bring about specified improvements in mental efficiency, 
and so on. These pedagogical issues are perfectly familiar to our 
readers, especially as they become formulated in terms of the vari- 
ous school subjects, e.g., ‘‘Does training In geometry improve 
reasoning?’’ ‘‘Does training in Latin assist the pupil in acquiring 
French or German?’’ ‘‘Does penmanship transfer?’’ meaning 
“Does the child continue in his after-school life to follow the tech- 
nique of writing taught him laboriously in the grades?’’ ete. 

Such, then, is the problem of the transfer of training. In the 
simplest terms it is: Does the improvement in a single response to 
stimulation entail improvement of any other response? In more 
complex terms it becomes: Does training in a complex and more 
or less related series of responses (like that involved in learning a 
given branch of subject matter) entail perceptible increase in 
capacity to learn some other series of responses (like those involved 
in learning some other branch of subject matter) ? 


II. GenrraL MEeTHops AND MATERIALS USED IN THE EXPERIMENTAL 
INVESTIGATION OF TRANSFER 


The literature of experimentation in the field of transfer is 
voluminous; to attempt a comprehensive summary of all the in- 
vestigations that have been reported would far exceed the space 
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allotted to this chapter. As a matter of fact, such an attempt is 
unnecessary because several such summaries have already been 
made. 

Our aim here is to describe certain investigations that may be 
regarded as typical of the methods and materials that have been 
used and of the interpretations that have been drawn. 

First, however, a few words about the methods, the materials, 
and the subjects used in the experimental work. 


1. Methods 


With respect to methods, it is possible to distinguish (a) the 
individual method, (b) the one-group method, (ec) the two-group 
method, and (d) the three-group method. 

(a) The individual method consists, of course, of experimenting 
done by one person who measures his ability in some specified 
activity, next trains himself for some time in some more or less 
related activity, and then measures once more his ability in the 
specified activity, with the intent to discover to what extent, if any, 
the period of training has affected his performance in the specified 
activity. The early work of William James, in his study of trans- 
fer of training in the field of memory, illustrates this method. 
Naturally, in the hands of a skilled psychologist, this method may 
yield results of significance, especially when fortified by introspec- 
tive comments, but methodologically it can scarcely be defended. 

(b) The one-group method differs from the individual method 
primarily merely in increasing the number of subjects. An ob- 
vious illustration is afforded by the early experiments reported 


1The reader will find useful summaries, for example, in the following: 
Colvin, 8S. S. The Learning Process, Chs. XIV-XVI. 
Heck, W. H. Mental Discipline and Educational Values. 
Henderson, E. N. TYeautbook in the Principles of Education, Ch. X. 
Hewins, Nellie P. Zhe Doctrine of Formal Discipline in the Light of 
Experimental Investigation, pp. 4-48. 
Rugg, H. O. The Experimental Determination of Mental Discipline in 
School Studies, Ch. I. 
Starch, D. Educational Psychology, Chs. XITI-XIV. 
Thorndike, E. L. Educational Psychology, Vol. II, Ch. XII. 
Of these, perhaps the most comprehensive and useful is that of Rugg, who, 
by means of charts, has prepared an excellent synoptic summary of the essen- 
tial features of all the studies prior to 1915. 
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by Thorndike and Woodworth in which a group of subjects prac- 
tised estimating the areas of rectangles until a ‘‘very marked im- 
provement was attained.’’ The same group then estimated areas 
of the same general size but of different shape and areas of the 
same shape but of different size. By the use of this method indi- 
vidual irregularities in the effect of training are more or less 
eliminated or compensated, but there is no way to determine accu- 
rately how much of the improvement is due to a real transfer 
effect, how much is due to improvement within the test series itself. 

(c) The two-group method has been generally employed in all 
the more recent experimentation to meet the objection just cited. 
In principle, the method is simply this: the subjects are divided, 
on some desired basis, such as age or general intelligence or pre- 
vious training, into two equivalent groups, of which one is known 
as the ‘‘Trained,”’ or the ‘‘Experimental’’ group, the other as the 
*‘Control’’ group. Both groups take both the preliminary and 
the final test series, but only the former takes the intermediate 
training series. If the division of the groups is carefully done, the 
difference between the performance of the trained group and the 
control group in the final test may be expected to indicate the 
amount of transfer effect, uncomplicated by possible effects of 
practice in the preliminary series or of maturation, incubation, 
and similar influences that might conceivably cause the final test 
results to differ from the preliminary test results. 

(d) The three-group method is an elaboration of the two-group 
method which may well be justified under certain conditions of 
experimentation. This method follows the plan of the preceding 
one, but there is added a third group (equivalent in characteristics 
of its personnel to the others) which takes none of the tests except 
the final one. If, then, unavoidable changes are present in the final 
tests, as in the weather, in the school program, in the experimenters, 
in the place, ete., so that conditions differ from those prevailing in 
the preliminary test, their effect may be studied and allowed for. 
An illustration of the use of this method may be found in C. P. 
Wang’s investigation of the general value of sense training.” 


*Wang, C. P. The General Value of Visual Sense Training in Children. 
Educational Psychology Monographs, No. 15, Baltimore, 1916. 
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2. Materials 


By ‘‘materials’’ I refer to the nature of the mental activities 
chosen as the objects of the investigation. The investigations may 
be conveniently assembled, for descriptive purposes, into four 
groups, viz., (a) those dealing with memorizing, (b) those dealing 
with various aspects of perceiving, including sensory discrimina- 
tion and apprehension, (¢) those dealing with voluntary effort and 
motor adjustments, (d) those dealing with schoolroom activities 
and attitudes. It might be possible to make a fifth group to include 
investigations dealing with so-called ,‘‘cross-education,’’ or the 
transference of practise gained in one member of the body to a 
symmetrical member, 


3. Subjects 


There has been a decided tendency to carry on the investigation 
of the transfer of training with adults as subjects. Thus, taking 
25 of the studies reported prior to 1916, we discover that only 
seven of these were conducted with school children, while of the 
remaining 18, eight used only graduate students or instructors in 
psychology and the other ten used college or normal-school stu- 
dents who were for the most part specializing in psychology. While 
this tendency to make use of adults, particularly of those adults 
immediately available for work as subjects in the psychological 
laboratory, is natural enough and has certain advantages (greater 
steadiness of application, ready grasp of instructions, possibility of 
illuminating introspection), it has likewise very definite disadvan- 
tages, as will be pointed out more fully farther on. 

Similarly, there has been a decided tendency to carry on the 
investigation of the transfer of training with a quite limited 
number of subjects. Thus, in 13 of 29 investigations or sub-sec- 
tions of investigations, conclusions of more or less import have 
been based upon the behavior of two, three, four, five or six sub- 
jects only. In 12 other investigations the number of subjects has 
ranged from 12 to 44. 

Again, if the reports of 31 investigators whose work can thus 
be classified are grouped into laboratory investigations and school- 
room investigations, we discover that more than two-thirds of them 
(21 to be exact) are laboratory investigations. 
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We may say, then, that the typical experimental study of the 
transfer of training is the laboratory investigation in which the 
subjects are a small group of students who are specializing in psy- 
chology. It has long been apparent to many educational psycholo- 
gists that conclusions reached by the laboratory study of the trained 
adult may not be too hastily generalized to apply to the mental 
processes of the growing child, and it is certainly wise to keep this 
caution in mind when dealing with such an issue as the possibilities 
and limitations of training, which, in the nature of the case, implies 
the engineering of modifications in the mind of a still immature 
organism. 


III. Some Typicau INVESTIGATIONS OF TRANSFER 
AND THEIR RESULTS 


1. Studies Dealing with Memory 


(a) James. In his Principles of Psychology (1890), Vol. I, pp. 
666-7, James reported the effect of training in memorizing a cer- 
tain kind of material upon efficiency in memorizing other kinds 
of material. He was led to conclude that there was no trans- 
fer of training, that one’s native retentiveness is unchangeable, that 
no amount of culture modifies a man’s general retentiveness. This 
work of James represents, of course, a pioneer exploration. The 
experiment was loosely organized; in particular, as has already 
been noted, there was no control group. 

(b) Peterson. The difficulty just mentioned in James’ work 
was recognized in the repetition of the experiment in 1912 by Peter- 
son,* but his data were too meager to permit reliable conclusions; 
in fact, three of the unpracticed subjects gained as much as one of 
the two practised subjects; nevertheless, we find Peterson saying 
that ‘‘there was certainly a large transfer of training here’’ (refer- 
ring to one subject). 

(ce) Ebert and Meumann. The experiments of these investi- 
gators at the University of Zurich* on from 2 to 6 laboratory sub- 


* Peterson, H. A. ‘‘Note on a retrial of Professor James’ experiment on 
memory.’’ Psych. Rev., 19: Nov., 1912, 491-492. 

‘Ebert, E., und Meumann, E. ‘‘ Ueber einige Grundfragen der Psycholo- 
gie der Uebungsphanomene im Bereich des Gedichtnisses.’’ 
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jects dealt with the effect of memorizing series of nonsense syllables 
upon efficiency in memorizing various other sorts of material. 
These experiments are of interest because the investigators claimed 
to have demonstrated the existence of a very large amount of trans- 
fer, explained not only as due to increased capacity to attend, better 
technique in learning, ete., but also as due to a hypothetical ‘‘sym- 
pathetic interaction of allied memory functions.’’ 


(d) Dearborn. The repetition of the work of Ebert and Meu- 
mann by W. F. Dearborn,® then of the University of Wisconsin, 
showed that a large portion of the transfer effect alleged by Ebert 
and Meumann was due to the gains made within the test series and 
that the remainder could be attributed to general improvement in 
orientation, attention, and better technique of learning. 

(e) Fracker. A careful piece of work by Fracker,® from the 
laboratory of the University of Iowa, in 1908, dealt with the effect 
of training in remembering the order of sound intensities upon 
efficiency in remembering the order of various other materials. 
Fracker’s results are regarded by him as furnishing clear evidence 
of transfer under certain conditions, provided ‘‘transfer’’ is 
thought of not as a mere ‘‘spread’’ of training, but as ability to 
use in a second situation a content or a form of procedure that is 
identical with the one trained. There is little spread of training 
without this identity of form or content. Ability to organize facts 
by conscious use of imagery, for example, is transferable, and it 
seems probable that it is especially in identical elements of an 
ideational sort that the carriers of the training effect are to be 
found. 

(f) Winch. In 1908, and again in 1910, W. H. Winch,’ of 
London, reported one of the first attempts to study transfer in 
children working under normal schoolroom conditions and with 
special pains to classify the subjects into two groups of equivalent 
ability. Winch’s experimental groups were trained in rote memo- 
rizing of poetry or of meaningless things and the effect was meas- 


®’Dearborn, W. F. ‘‘The general effects of special practise in memory.’’ 
Psych. Bull., 6: 1909, 44. 

°Fracker, G. C. ‘‘On the transference of training in memory.’’? Psych. 
Rev. Mon. Supp., No. 38, 1908, pp. 56-102. 

7Winch, W. H. ‘‘The transfer of improvement in memory of school- 
children.’’ British Jour. Psych., 2: 1908, 284-293, and 3: 1910, 386-405. 
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ured that this training produced upon the memorizing of other 
forms of material. He construes the results as indicative of a real 
transfer of training; however, the amount of transfer cannot be 
regarded as great when proper allowance is made for the operation 
of chance and for certain features of method and material used 
in the study. 


(g) Sleight. One year after Winch’s second paper appeared 
another English investigation, that of Sleight,? in which 12-year- 
old school children and also adults in the laboratory were given 
training in memorizing poetry, arithmetical tables, and prose sub- 
stance and were tested for the effect of this training on other 
forms of memorizing. The results give only slight ground for 
' claiming transfer. The ‘area’ affected by special training is quite 
narrow, and if training does occur through the presence of identical 
elements, the identity must be very close and must include identity 
of procedure, of imagery, and of logical organization. In particu- 
lar, the mere presence of an identical element is not sufficient to 
produce transfer; the element must also be usable by its possessor. 
Furthermore, a considerable degree of similarity between two bits 
of content may be entirely ineffective as a basis for transfer if 
there also exists an important difference in the two procedures. 
Again, there may appear actual interference with a latent tendency 
toward transfer if the second activity chances to be unpleasant 
for the subject. There is no ‘‘general memory faculty’’ and no 
generalized training of attention. 

(h) Woodrow. Quite recently there has appeared a study of 
transfer of memory training, conducted by Woodrow® upon three 
groups of university sophomores. The 106 students in the Control 
Group took only the beginning and end tests; the 34 students in 
the Practice Group were given straight practice in memorizing 
poetry between the two tests; but the 42 students in the Training 
Group had, in addition, specific instruction in the technique of 
memorizing and in applying this technique. The second group 
(practiced) were sometimes better, sometimes poorer than the first 


® Sleight, W. G. ‘‘Memory and formal training.’’? British Jour. Psych., 
4: 1911, 386-457. 

° Woodrow, H. ‘‘ The effect of type of training upon transference.’’ Jowf 
Educ. Psych., 18: March, 1927, 159-172. 
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group (control), but the third group (trained) average in the 
end tests 31.6 percent more gain than the control group. Woodrow 
says that ‘‘to determine the amount of transference of ability that 
did result from the study of a particular subject is one thing, to 
determine the amount of transfer which might be secured from the 
study of the same subject is a different and far more difficult 
thing.’’ The experiment, he thinks, shows the difference ‘‘ between 
unenlightened drill and intelligent teaching.”’ 

This study does seem to me especially interesting. It would 
seem to support the contention of those who ascribe particular 
Significance to generalization as the vehicle of transfer. What 
appears to have happened is that the merely practiced students 
failed of themselves to educe adequate general principles governing 
the technique of memorizing, whereas these were supplied by the 
instructor to the Trained Group. On the other hand, the question 
may be raised as to whether we are really dealing with a problem of 
transfer (in the usual sense) if we simply demonstrate that direct 
instruction in a technique of intellectual procedure can afterwards 
be employed by intelligent mature students in various concrete 
situations. In other words, Woodrow should have shown us what 
amount of improvement could have been secured by instruction 
but no practice. ‘ 


2. Studies Dealing with Perception, Discrimination, 
and Apprehension 


(a) Thorndike and Woodworth. In their laboratory experi- 
ments reported in 1901 these authors’® trained from four to six 
laboratory subjects in estimating small areas, weights or lines and 
in the perception of certain letters and parts of speech, then meas- 
ured the effect of this training upon the estimation of other areas, 
weights, and lines, and .the perceiving of other letters and parts 
of speech. Despite the close similarity of the training and the 
test series, the general quantitative outcome indicated a compara- 
tively slight transfer of training, and there were evidences of inter- 
ference (negative transfer) under certain conditions. The authors 


2” Thorndike, E. L., and Woodworth, R. 8. ‘‘The influence of improvement 
in one mental function upon the efficiency of other functions.’’ Psych. Rev., 
8: 1901, 247-261, 384-395, 553-564. 
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of this often-quoted study contend that there are no such “‘gen- 
eral’’ functions open to training as attention, accuracy, or power 
of observation, and that whatever transfer takes places does so by 
dint of the presence of identical elements in the influencing and 
influenced functions. 

My criticism of this contention has already been foreshadowed. 
Why expect academically trained adults to be open to further im- 
provement of such ‘‘general functions’’? 

(b) Judd. While working in the Yale laboratory, Judd" 
trained two subjects in the Miller-Lyer illusion and found that, 
by dint of practice, and without the aid of abstract judgment, the 
illusion gradually disappeared and that the practice-effect was 
transferred. The effect was ascribed to perceptual training. 


(ec) Coover and Angell. Six subjects trained in introspection 
were used by Coover and Angell” in their investigation of transfer 
in the field of sensory discrimination. The effect of training in the 
discrimination of sound intensities was found to transfer to the 
discrimination of grays, but this result is supported by assertions 
based upon introspection rather than by any clear statistical 
demonstration, since the two control subjects seemed to improve as 
much as the four experimental subjects. 

(d) Whipple, Foster,* and Dallenbach.® The experimental 
work of these three, all studies from the Cornell Educational 
Laboratory, may be presented conjointly. In all three investiga- 
tions the material consisted of drawings, numbers, lines of poetry, 
groups of objects, and the like, exposed to view very briefly. 

The work of Whipple and of Foster was limited to laboratory 
test upon competent adults. The general outcome was that there 
was no improvement of any basic intrinsic ability, like attention 
or apprehension or visualizing, that the superficial improvement 


u Judd, C. H. ‘‘Practice and its effect on the perception of illusions.’’ 
Psych. Rev., 9: 1902, 27-39. 

*Coover, J. H., and Angell, F. ‘‘General practice effect of special exer- 
cise.’?? Amer. Jour. Psych., 18: 1907, 327-340. 

* Whipple, G. M. ‘‘The effect of practice upon the range of visual atten- 
tion and of visual apprehension.’’ Jour. Hduc. Psych., 1: 1910, 249-262. 

“Foster, W. S. ‘The effect of practice upon visualizing and upon the 
reproduction of visual impressions.’’ Jour, Educ. Psych., 2:1911, 11-21. 

*Dallenbach, K. M. ‘‘The effect of practice upon visual apprehension.’’ 
Jour. Educ, Psych., 5: 1914, 321-334, 387-404. 
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was very specific and explicable as due to habituation to the setting 
of the experiment, to special ‘tricks’ or techniques rapidly acquired 
during the experimenting, to the development of confidence, ete. 

In the work of Dallenbach, however, when similar rapid-ex- 
posure practice was given to a class of third-grade school children, 
not only was there a marked practice-effect, but also evidence of the 
transfer of this effect, brought out more especially by retesting 
after 11 months in comparison with untested pupils. Dallenbach 
concluded that the training had permanently altered and modified 
certain mental traits; in other words, that such exercises as 
Whipple and Foster found to be ineffective in altering the basic 
capacities of adults were really effective in making such alteration 
in children. 


(e) Ruger. Hither in this section or the next may be classed 
the investigation conducted by Ruger?® at Columbia University. 
By employing 37 mechanical puzzles and by minute records of the 
subjects’ work, Ruger was able to observe to what extent solutions 
arrived at in given puzzles were transferred to the solution of other 
puzzles of varying degrees of similarity. The outcome of this study 
seems quite unlike that reported by Judd in the practice with illu- 
sions of length, for Ruger concluded that the presence of imagery 
was of no avail without conscious generalization of methods of 
attack and analysis. Other factors making for-transfer were ideals 
of efficiency, attitudes of ‘‘high-level attention,’’ satisfaction of 
success, ete. ‘‘The value of specific habits under a change of con- 
ditions depended directly on the presence of a general idea which 
would serve for their control.’’ 


(f) Wang. Somewhat similar to the study of Coover and 
Angell is that carried on at the University of Michigan by a Chi- 
nese student, Wang,’ who trained a small number of pupils in 
discriminating the lengths of vertical lines and tested the effect 
of this training upon their ability to discriminate pitch, to dis- 
criminate shades of color, to discriminate sizes of figures, and to 
mark words containing certain letters. Wang found that no trans- 


* Ruger, H. A. The Psychology of Efficiency. Archives of Psychology, 
No. 15. 1910. 

Wang, OC. P. The General Value of Visual Sense Training in Children. 
Edue. Psych. Monographs, No. 15, 1916. 
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fer appeared unless the children were able to develop an efficient 
method in the training series and to use this purposefully in the 
test series. 


3. Studies Dealing With Voluntary Effort and 
Motor Adjustments 


(a) Gilbert and Fracker. One of the earliest studies in this 
field is that of these two Iowa investigators,** who worked in the 
laboratory with the stock forms of reaction-time. For their three 
subjects large amounts of transfer-effect were noted; thus, prac- 
tice in simple sound reaction reduced the time for simple reaction 
to other stimuli almost as much as it reduced the time for the 
sound reaction; similarly with practice in discrimination reactions, 
but practice with the former did not affect ability in the latter. It 
must be evident that the general setting of the reaction test is so 
similar for the various types of reaction that one would expect 
such an improvement as was reported to oceur through such ex- 
ternal or indirect factors as habituation, acquiring favorable 
set, etc. 


(b) Bair. In the Columbia Laboratory, in 1902, J. H. Bair,?® 
in connection with various experiments on the practice curve, en- 
countered situations permitting a study of transfer, e.g., the effect 
that training in tapping rapidly on the typewriter a given series 
of exposed characters had upon speed of tapping other series, and 
again, the effect that training in reciting the alphabet rapidly 
under certain conditions had upon reciting it under other condi- 
tions. The general outcome. of Bair’s work was evidence for a 
definitely transferred practice-effect; ‘‘one kind of practice helps 
us in another kind.’’ Bair also reported numerous interference 
effects, and it is of interest to note that interference often lessens 
as practice continues, until it disappears and the transfer may then 
become positive and favorable to the new activity. 


(ec) Coover and Angell. In the investigation by these authors 
that has already been mentioned an experiment was included in 


% Gilbert, J. A., and Fracker, G. C. ‘‘The effects of practice in reaction 
and discrimination for sound upon the time of reaction and discrimination of 
other forms of stimuli.’? Univ. of Iowa Studies in Psych., 1: 1897, 62-76. 

* Bair, J. H. The Practice Curve. Psych. Rev. Mon. Supp., No. 19. 1902. 
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which subjects were trained for forty days in the rapid sorting 
of cards. This training was said to have definitely increased their 
efficiency in certain aspects of typing. The investigators designed 
this experiment to eliminate all identical factors and hence ascribe 
the statistical improvement to certain ‘‘general’’ factors, like 
equitable distribution of attention and development of power to 
concentrate attention throughout an entire series of reactions. This 
is one of the few studies in which a claim has been advanced for 
such training of a general function. 

(d) Wallin. The experiment of Wallin,?° in which two subjects 
were practiced for 60 days in controlling with one eye illusions of 
reversible perspective with the result that this practice was found 
to be transferred to the other eye, may be cited as a typical instance 
of the so-called ‘‘cross-transfer of training.’’ It seems obvious 
that the training occurred primarily elsewhere than in the used 
retina, so that its effect would naturally appear when the pre- 
viously unused retina was employed. The same remarks may be 
made concerning Wallin’s further observation of transfer from 
foveal to peripheral perception of these illusions. 


4. Studies Dealing With Schoolroom Activities 
and Attitudes 


(a) Bagley, Squire, and Ruediger. An experiment planned by 
Mrs. Carrie Squire and reported by Bagley”? concerned the transfer 
of a habit or ideal of neatness from one branch of school work to 
other branches. This led Ruediger?? to investigate the matter fur- 
ther a few years later. 

The Bagley-Squire work seems to have been seized upon as a 
basis for pedagogical generalizations that far exceed its warrant. 
In substance ‘‘the test consisted simply in insisting on neatness and 
accuracy in the preparation of arithmetic papers by pupils in the 
latter half of the third grade.’’ Three weeks’ persistent drill pro- 
duced definite results in these papers, but no results, save a slight 


* Wallin, J. HE. W. ‘‘Two neglected instances of the transfer of training.’’ 
Jour. Educ. Psych., 1: 1910, 168-171. 

7 Briefly in his Educative Process (1905), p. 208, and more fully in his 
Educational Values (1911), pp. 188-189. 

™Ruediger, W. C. ‘‘ The indirect improvement of mental functions through 
ideals.’? Educ. Kev., 36: 1908, 364-371. 
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loss, in improving neatness in the language or spelling papers. 
The contrast was ‘‘almost startling.’’?* Drill in specifie habits of 
neatness did not produce a general habit of neatness. 

In Ruediger’s subsequent extension of the neatness experiment 
the specific training in neatness in one school subject was supple- 
mented by talks on neatness elsewhere (in dress, in the home, etc.). 
Small gains appeared in neatness, not only within the school sub- 
ject specifically trained but also within other subjects. Ruediger 
stresses in especial the operation of conscious ideals as an important 
factor in effecting the transfer of the habit. 


(b) Winch. In addition to the investigation previously men- 
tioned, this British investigator” undertook several studies of the 
transfer of training in numerical accuracy in school children. In 
the first set of experiments, of the four studied, there was no trans- 
fer in one school, but definite transfer in the other three schools. 
The transfer here consisted of the effect of training ‘‘in working 
rule sums’’ upon subsequent accuracy in solving arithmetical prob- 
lems. In the later study, in which 36 ten-year-old boys were given 
ten practice exercises in numerical computation, this group did not 
surpass the unpracticed control group of 36 boys in subsequent 
tests of ability to solve arithmetical problems rationally (but with- 
out actual numerical calculations), that is, drill in computation 
did not transfer to skill in problem solution. Of interest in this 
investigation is Winch’s demonstration that the existence of a high 
positive correlation (0.85 in one case) between these two abilities 
does not permit us to argue to a causal relationship such that the 
improvement of one of the related abilities necessarily produces im- 
provement of the other. 


“This phrase was used by Bagley in 1905. He then said: ‘‘Careful ex- 
periments were undertaken to determine whether the habit of producing neat 
papers in arithmetic will function with reference to neat written work in other 
studies.’? But in 1911 he wrote: ‘‘Experiments undertaken in the Montana 
State Normal College seemed to substantiate these conclusions. These experi- 
ments tested very crudely the ability to transfer the results of training in 
neatness and accuracy’’ (italics mine). It is this later statement which sug- 
gests that rather too much significance may have been attached to the Bagley- 
Squire experiment. 

* Winch, W. H. ‘‘Accuracy in school children. Does improvement in 
numerical accuracy ‘transfer?’ ’’ Jour. Educ. Psych., 1: 1910, 557-589. Also 
‘¢Further work on numerical accuracy in school children. Does improvement in 
numerical accuracy transfer?’’ Jbid., 2: 1911, 262-271. 
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(c) Starch. Though carried on (presumably) in the labora- 
tory with college students, we may class among the studies of school 
activities the study reported from the University of Wisconsin by 
Starch,?® in which eight subjects were drilled for 14 days on mental 
multiplications and in which their efficiency in subsequent tests of 
various arithmetical operations and in memory span for words was 
compared with that of an unpracticed control group. There was 
“*little change for memory span for either group,’’ but ‘‘the prac- 
ticed observers showed from 20 to 40 percent more improvement 
in the arithmetical tests than the unpracticed observers.’’ In 
weighing the significance of this large transfer-effect, however, we 
should note that the abilities demanded in the test series were 
closely similar, if not at times almost identical, with those demanded 
in the drill series. 

(d) Wallin. Another school subject, that of spelling, was the 
material chosen by Wallin”® for a study of transfer of efficiency 
gained in ‘column drills’ to performance in dictated compositions. 
This investigation was made on more than a thousand pupils of the 
publie schools of Cleveland. Some three to thirteen weeks after 
several weeks of drill in column spelling, the pupils were graded 
on their spelling of 40 to 50 words employed by their teachers in 
dictation. Wallin concludes that the column drill may produce a 
positive increment of spelling efficiency, an increased facility to 
master new forms. Despite the large number of pupils employed 
in this study, the experimental conditions were not ideal to yield 
clear-cut evidence of transfer (there was, for instance, a certain 
amount of incidental teaching of spelling). 


(e) Briggs. Decidedly complicated is the contribution of 
Briggs,?7 who employed two groups of 7th-grade children for an 
investigation of the effects of formal instruction in English gram- 
mar. An extended series of 54 problems and exercises constituted 
the preliminary and final test series and the intervening drill was 
in grammar taught somewhat inductively. Supplementary trials 


* Starch, D. ‘‘Transfer of training in arithmetical operations.’’ Jour. 
Educ. Psych., 2: 1911, 306-310. 

* Wallin, J. E. W. Spelling Efficiency in Its Relation to Age, Sex, and 
Grade. Edue. Psych. Monographs, 1911. 

* Briggs, T. H. Formal English grammar as a discipline.’’ Teachers 
Coll. Record, 14: Sept., 1913. 
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of the tests were made in five public school systems, some of which 
taught grammar and some of which did not. Taking the average 
performance of the classes under test, the net outcome is negative, 
1.e., to quote Briggs: ‘‘It may safely be asserted that these par- 
ticular children, after the amount of formal grammar they had, do 
not, as measured by the means employed, show in any of the abili- 
ties tested, improvement that may be attributed to their training 
in formal grammar.’’® 


(f) Hewins. The monograph by Miss Hewins”® deals with a 
more mature group of pupils (high-school freshmen) and with a 
different mental function (training in observation). After divid- 
ing the pupils of several classes into equivalent groups on the basis 
of suitable preliminary tests, one group in each class was drilled 
on the observation of biological material. Final tests were given 
to the entire class on the observation of non-biological material. On 
the whole, the results justify the assertion that ‘‘there are certainly 
conclusive evidences of transfer.’’ 

(g) Rugg. Omitting detailed reference to Bennett’s attempt? 
by studying college marks and scores in laboratory tests to show 
that the four years of college work is effective only in increasing 
specific abilities along the lines of special interests, we may cite 
the work of Rugg*! as perhaps the most comprehensive study of the 
transfer of training among college students. Five hundred Uni- 
versity of Illinois students were tested, of whom 326, the training 
group, pursued a course in descriptive geometry. The aim of the 
investigation was to determine the extent to which the special 
training received in this course operated to increase efficiency in 
various other mental activities of different degrees of similarity to 
the activities trained. The results show that the practice in geo- 
metric visualization received in the 15 weeks’ work in descriptive 
geometry did substantially increase the efficiency of ‘‘mental 
manipulation’’ of both geometric and non-geometrie elements, and 


* The reader interested in a critique of this investigation may consult a 
review of it by Bagley, Jour. Educ. Psych., 5: 1914, 538-540. 

*® Hewins, Nellie P. The Doctrine of Formal Discipline in the Light of 
Experimental Investigation. Educ. Psych. Monographs, No. 16, 1916. 

* Bennett, C. J. C. Formal Discipline. Columbia University Contribu- 
tions to Education, 1907. 

* Rugg, H. O. The Expervmental Determination of Mental Discipline in 
School Studies. Educ. Psych. Monographs, No. 17, 1916. 
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that the degree of transfer, as would be expected, varied with the 
degree of similarity between the practise and the test activity. As 
for the agencies of transfer, these were found by Rugg to lie not 
only in the direct carrying over of specific habits, but also in better 
methods of attack, better attitudes of orientation, and an increased 
facility in manipulating many elements at the same time. 

(h) Thorndike. One of the most ambitious recent studies of 
transfer effect in the school is that reported by Thorndike®? who 
used statistical methods applied to intelligence test scores to deter- 
mine the effect of a year’s school work in various high-school sub- 
jects upon the amount of the traits measured by these test scores. 
The essentially negligible disciplinary effect thus unearthed may 
perhaps be set forth most effectively by the following quotation: 

‘‘Tf our inquiry had been carried out by a psychologist from 
Mars, who knew nothing of theories of mental discipline, and simply 
tried to answer the question ‘What are the amounts of influence 
of sex, race, age, amount of ability, and studies taken, upon the 
gain made during the year in power to think, or intellect, or what- 
ever our stock intelligence tests measure,’ he might even dismiss 
‘studies taken’ with the comment: ‘The differences are so small 
and the unreliabilities are relatively so large that this factor seems 
unimportant.’ The one causal factor which he would be sure was 
at work would be the intellect already existent. Those who have the 
most to begin with gain the most during the year. Whatever studies 
they take will seem to produce large gains in intellect.’’ 

Special mention may be made also of the conclusion that the 
physical sciences are equal, if not superior, to languages and mathe- 
matics in respect of mental discipline from any point of view. 


IV. Wuat Has Been Proved CONCERNING THE AMOUNT 
oF TRANSFER 


While by no means exhaustive, the foregoing description of 
typical investigations has been sufficient to indicate the character- 
istic methods by which the problem of transfer has been attacked 
experimentally. But the outcomes of these investigations are so 
varied that some attempt should be made to summarize their gen- 
eral trend. This I shall attempt by setting down a series of cate- 
gorical statements. 


Thorndike, E. L. ‘‘Mental discipline in high-school studies.’’? Jour. 
Educ. Psych., 15: 1924, 1-22, 83-98. 
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1. In the light of what we now know to be necessary for relia- 
ble inference, many of the experimental studies of transfer are so 
imadequate in scope and method that they afford altogether too 
shaky ground for establishing permanent conclusions. 

Thus, the subjects have been too few in number or already too 
well ‘crystallized’ mentally to expect any marked transfer of im- 
provement. The material upon which the training has been ac- 
complished has often been, I think, too ‘picayunish’ to exert 
any natural appeal. Again, there has been often no control group 
or there has been no effort to compute statistically the extent to 
which mere chance has affected the result. Yet again, the training 
has been too brief to promise any far-reaching or significant altera- 
tion of the mental efficiency of the recipient. Finally, many of the 
investigations have picked for their subjects a half-dozen graduate 
students or instructors, who, in the very nature of the case, repre- 
sent a type of mind that has been rigorously selected for having 
already attained a superior stage of mental training. No doubt 
these objections may be exaggerated, but personally, I cannot avoid 
the conclusion that a considerable part of the experimental work 
on the transfer of training might as well be thrown out of the win- 
dow bodily so far as any significant contribution may be expected 
to our knowledge of the real possibilities and limitations of 
training. 

2. There can be counted on the fingers of one hand the experi- 
mental studies that have definitely yielded the verdict: ‘‘No 
transfer.’’ 

Here might be placed, for instance, the pioneer experiment of 
James, the neatness experiment of Mrs. Squire, the correlation 
study of Bennett, and the formal grammar study of Briggs. Some 
of the studies that have been quoted as supporting this position, 
like my own study of visual apprehension, are not properly classed 
as studies of transfer. They have merely shown certain essential 
limitations in the nature of the practice improvement that occurs 
in adults when faced with the acquisition of a new set of responses. 
Thus, to deny that practice in rapid observation improves any in- 
trinsic capacity or faculty of observation itself is not to deny the 
possibility of the transfer of training. It is unfortunate that this 
interpretation has gained such currency among certain popularizers 
of the experimental literature. 
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3. Several studies in which the investigator has claimed a large 
or a very definite amount of transfer provide inadequate proof of 
this claim or deal with a type of transfer in which the shift from 
trained to untrained activity is so. slight as to be of little practical 
significance. 

In making the assertion of ‘‘inadequate proof,’’ I have in mind 
the investigations of Ebert and Meumann and of Coover and 
Angell. In making the assertion of ‘‘no practical significance’’ I 
have in mind the so-called ‘‘transfer of training’’ in the reaction- 
time tests of Gilbert and Fracker or Wallin’s ‘other eye’ transfer 
or Wallin’s transfer from spelling in a column to spelling in a 
sentence. To exaggerate to the point of grotesqueness, of what 
avail to demonstrate that having trained myself to sing a song 
while wearing a hat, I find I have improved in singing it with my 
hat off? 

4. In a preponderant number of investigations there has been 
revealed a measurable, though restricted amount of transfer. In 
general, the more similar the influencing and the influenced activi- 
ties, the more certain the transfer of efficiency and the greater its 
amount. 

Anyone who reads the literature of formal discipline in chrono- 
logical order will perceive clearly that investigators have been 
swayed at times by their own attitude toward the popular notions 
of transfer prevalent when their investigations were conceived. 
The effect of this attitude is not infrequently revealed in the man- 
ner in which the investigator has formulated his conclusions. If 
he thinks there is current a naive belief in general cultural train- 
ing, in wide-spread transfer, then a definite, though moderate 
amount of transfer in his experiments is set forth as a demonstra- 
tion of the specificity of training and the futility of belief in formal 
discipline, although it might quite as well have been heralded as a 
partial justification of the dogma. 

5. The transfer of training may impair the second activity, 
rather than improve it. It appears likely that this so-called ‘nega- 
tive’ transfer is most pronounced when the second activity 1s begun 
and that it may not only be reduced to zero, but become transformed 
into a positive transfer if the second activity is continued long 
enough. 
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This outcome is perhaps sufficiently perplexing at first view to 
warrant a simple illustration. A man well trained in operating a 
standard automobile gear-shift, when suddenly called upon to 
operate a car with reversed shift (like the Dodge, for instance), 
might, on account of interference, perform the shifting at first less 
efficiently than a novice; if he persisted, however, he would 
eventually surpass the novice, because he would already have 
learned the underlying general principles of correct gear-shifting 
and these would be ‘carried over’ advantageously, once the spe- 
cific differences between the two cars were overcome. 


V. Wuat Has BEEN PROVED CONCERNING THE AGENCIES 
OF TRANSFER 


1. The Doctrine of ‘Common Elements’ 


Referring to our schematic explanation of the problem of trans- 
fer and to the summary just made of the quantitative results of 
transfer experiments, it is evident that there can be neither com- 
plete specificity of response nor complete interlocking of all re- 
sponses, because there is commonly discovered a certain amount, 
but a restricted amount, of influence from training series to subse- 
quent test series. And wherever there is spread, or transfer, of 
improvement from one activity to another, it is certainly a fair 
presumption that the two activities—neural activities, of course, in 
the last resort—must have one or more features in common.** This 
argument is equivalent to declaring that transfer takes place 
through ‘‘identical elements.’’? If this phrase—‘‘common ele- 
ments’’—be taken in a broad sense, all the theories of transfer, so 
far as I can see, can be said to agree, that is, all explanations of 
transfer posit the operation of a common factor between the influ- 
encing and the influenced function. The point of dispute lies, 
however, in the question: What are the common elements that are 


* Poffenberger, in summarizing his study on ‘‘The influence of improve- 
ment in one simple mental process upon other related processes’? (Jour. Edue. 
Psych., 6: 1915, 459-474), concludes similarly: (1) that if there are no 
identical associative processes common to two activities, training in the first 
activity has no influence at all on the second, (2) that if there are some iden- 
tical processes, there will be positive transfer, but (3) that if the second 
activity necessitates breaking associative connections learned in the first, 
e.g., adding after multiplying, there will be negative transfer, or interference. 
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ordinarily responsible for transfer? To examine this issue, a dis- 
tinction, useful for purposes of discussion, may be drawn between 
common content and common method. 


2. Common Content 


So far as common content is concerned, this type of transfer 
may be readily illustrated if we think of ‘content’ as ‘items of in- 
formation.’ Thus, the study of Latin helps the student later with 
his French because the French is a Romance tongue and many 
French words have been carried over with slight modifications from 
the Latin. It sometimes seems as if this sort of carrying over of 
learning were too obvious or commonplace to be dignified by the 
designation ‘‘transfer.’’ I mean there hardly seems to be any real 
transfer. You perceive readily that the French terre must mean 
earth because you already know that the Latin terra means earth. 
Yet this sort of carry-over is commonly cited as a case of transfer 
of training because the terra-earth response was acquired in one 
situation (translating Latin) and now appears as an aid to the 
slightly variant terre-earth response in another situation (trans- 
lating French). Or, to use another illustration, we are discussing 
identity of content when we raise such a question as: ‘‘ Does train- 
ing in educational psychology given to prospective teachers in the 
school of education transfer to their classroom teaching?’’ because 
we mean: ‘‘When a concrete situation arises wherein a certain 
psychological precept properly applies, does the teacher recall the 
precept, see and undertake its application?’’ The same aspect of 
the general problem appears, to cite another instance, when we 
inquire whether training in the responses of the multiplication 
table transfers adequately to the use of multiplication in problem 
solving. 


8. Common Method 


If by ‘method’ we understand ‘way of going about doing some- 
thing,’ in the broadest sense of that phrase, it seems to me that | 
all transfer that does not occur from community of content must 
occur from community of method. But we find that the literature 
teems with lengthy discussions of the possibility or impossibility of 
transfer through generalized method of various sorts, which is by 


202 THE TWENTY-SEVENTH YEARBOOK 


many thought of as distinct from transfer through what they term 
“‘common procedure.’’ I judge that a few concrete illustrations 
may be helpful in setting forth this controversy by seeking what 
distinction, if any, exists between common procedure and gen- 
eralized method. 


(a) Common Procedure. Suppose that a child discovers, when 
being trained to commit to memory the outline of a map, that it 
pays him to close his eyes and attempt to see the map in his mind’s 
eye. Suppose that, subsequently, when being tested in committing 
to memory a list of words to be spelled, he employs to advantage 
the same method of closing his eyes and trying to visualize each 
word. As I understand it, most investigators would agree that in 
this case transfer of training takes place through identity of 
procedure. 


(b) Generalized Method. Suppose that the same child, while 
being trained to observe a card of letters that is flashed before his 
eyes for a fraction of a second, discovers that it pays him to con- 
centrate his attention upon a limited number of the letters and 
make sure of these, rather than to try to take in every letter that 
ig exposed. Suppose that, subsequently, when being tested in 
recalling the gist of a rather long story that is told him, he recalls 
his previous experience with the card of. letters and definitely re- 
solves to concentrate his attention upon a few fundamental fea- 
tures in the story and not try to remember the remainder. I can- 
not see that the transfer in this second case differs in principle 
from that involved in the map and spelling case, yet we find that 
the discovery and use of a ‘‘method of distributing the attention’’ 
is classed by some writers as ‘‘transfer by generalization,’’ and as 
something that differs from ‘‘identity in procedure.”’ 

Suppose, thirdly, that’ this child, though somewhat dismayed 
by the card exposure training at first, was greatly cheered by learn- 
ing that he was really doing better than his mates, so that he said 
to himself: ‘‘I can do this better than the other boys if I only 
try hard.’’ And suppose that this same encouraging thought 
stimulated his work advantageously later when he had to recall the 
story. It seems to me that we may say that, in the broad sense of 
the term, we have here again a case of transfer through ‘method,’ 
through the development of a ‘way of going about it,’ but most 
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investigators feel that this type of transfer effect must have a spe- 
cial category; it is called ‘‘transfer through the agency of a gen- 
eralized attitudinal factor.’’ Bagley, as I understand him, might 
class this illustration as ‘‘transfer through ideals.’’ 

Perhaps these three illustrations will suffice to make clear my 
point, namely: Unless there is actual partial identity of subject 
matter, or content, any transfer of training that appears is the re- 
sult of the presence of a common factor of method. 


VI. THe PEDAGOGICAL SIGNIFICANCE OF TRANSFER 
1. Conscious Versus Unconscious Transfer 


The experimental work has made it evident that efficiency ac- 
quired in training may spread to related activities without any 
deliberate intent on the part of the person trained and in fact with- 
out his knowing what increased his efficiency or that any such 
factor was operating in the related activities. Perhaps it would be 
better to term this ‘‘unintentional’’ or ‘‘passive’’ or ‘‘automatie,’’ 
rather than ‘‘unconscious’’ transfer. If it were not for this sort 
of transfer, the limitations of training would be far more serious 
than they are. 

But the experimental work has also made it evident that in 
many situations (and J think especially in transfer through com- 
munity of method) transfer is either non-existent or decidedly 
limited unless the factors that are producing improvement in the 
training operations are consciously recognized and also consciously 
applied to bring about efficiency in the related operations. This is 
what Sleight refers to, for instance, when he says that mere 
presence of an identical element is not enough; it must also be 
usable by its possessor. The pedagogical moral is obvious: Teach- 
ers should arrange the work of pupils and their own instructional 
efforts in such a way as to facilitate the conscious recognition by the 
pupils of the methods by which efficient mental work 1s done. 


2. Facts Versus Relations as Vehicles of Transfer 


Another way of bringing out the relative importance of com- 
munity of content and community of method in effecting the trans- 
fer of training is to compare the value of facts with the value of 
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relations as vehicles in the process. When two or more facts are 
being handled in thinking, it is the relation of these facts to one 
another and the similarity of this relation to other relations be- 
tween facts previously handled that furnishes the usual basis for 
whatever transfer of training occurs. It is a relation that the 
individual has educed from consideration of the first set of facts 
that enables him to handle more efficiently the second set of facts.** 

The pedagogical moral here is likewise obvious: A skillful and 
‘artistic’ teacher will take pains to see that facts are so presented 
that pupils draw out the relations between these facts and utihze 
these relations in the handling of other sets of facts in which similar 
relations obtain. 

This leads us naturally to another important pedagogical 
principle. 


3. Dependence of Transfer Upon Native Mental Ability 


If efficient transfer hinges primarily upon the conscious realiza- 
tion and conscious utilization of relations between facts and rela- 
tions between relations, it can hardly be denied that the possibilities 
of transfer are to an important degree dependent upon what we 
commonly term ‘‘general intelligence,’’ upon native mental ability 
in intellectual operations. It is not necessary at this point to drag 
the reader into the controversy concerning the fundamental nature 
of intelligence, for I am sure all those who have discussed it are 
willing to agree that superior intelligence is not primarily con- 
ditioned by superior operation of the simple processes of sensation, 
perception, or even of the processes of sensory discrimination or 
imaging or memory, but rather by the superior operation of the 
relational processes. Put in other words, the outstanding feature 
of dullness is the comparative limitation of mental processes to 
sensitivity, bare retentivity, and simple associative reproduction, 


“Some writers go so far as to say that no transfer, takes place at all from 
the bare perception or cognition of facts, but only with respect to the appre- 
hension and use of relations educed between sets of facts (see, for instance, 
J. J. Strasheim, A New Method of Mental Testing, p. 147, and authorities 
quoted by him, like G. Thomson, Instinct, Intelligence, and Character, pp. 
140 ff.). The eduction of such relations may be extended, of course, to higher 
and higher levels; there may be relations between relations, and these ‘‘ higher 
level relations’? may extend, in competent minds, to almost any level of 
generalization. 


~ 
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whereas the outstanding feature of brightness is the comparative 
predominance of the apprehension, formulation, and utilization of 
generalized relations in the intellectual life.** Methods, laws, rules, 
maxims, precepts, are embodiments of just such generalized rela- 
tions. Whenever the experimental conditions are so arranged as 
to permit the possession of these outstanding features to emerge 
(for example, in the investigation of Strasheim, just mentioned), 
the contrast between dull children and bright children is striking, 
and the dependence of transfer upon native ability is clearly 
demonstrated. 

The pedagogical moral is again obvious: Teachers may expect 
bright children to surpass dull children in the amount of transfer 
gained from specific training; moreover, in proportion as the child’s 
intelligence 1s inadequate, in that proportion must the teacher him- 
self consciously and purposefully bring the child’s attention to the 
relations to be educed and to the use of these relations subsequently. 
Even when aided by such skillful instruction, the spread of training 
wil be decidedly less in the dull. 


4. Dependence of Transfer Upon Maturity 


Experimental investigation makes it evident that, as the child 
passes from infancy to maturity physically, there occurs a roughly 
similar growth in numerous mental abilities. This growth may be 
regarded, at least from the standpoint of theory, as a simple process 
of maturing, as a development that would take place with the lapse 
of time whether there were any formal training or not.*® Accept- 
ance of this doctrine is implicit, for instance, in all our machinery 
for computing mental age and intelligence quotients. From this 
point of view, we expect that the ability to educe and apply rela- 
tions will increase with maturity from birth to some period in the 
neighborhood of 14 years, and consequently, we would expect a 
growth during childhood and youth in the extent to which specific 
training can become generalized. 


%® On the testimony of teachers to this effect, see Baker, H. J. Character- 
astic Differences in Bright and Dull Children. Bloomington, Tlinois, 1927. 

* Of course, other definitions of maturity may be defended, but this one is 
at least useful for schematizing the factors underlying mental development, as 
I have tried to show elsewhere (‘‘Hndowment, maturity, and training as fac- 
tors in intelligence scores,’’ Scientific Mo., 18: 1924, 496-507.) 
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But, if we leave the matter here, we should expect that from 
the age of adolescence to perhaps some point where the processes 
of senescence intervene, the capacity to receive training and to profit 
by its spread would remain constant and maximum. Just here, 
however, we are confronted by considerable experimental evidence 
that the spread of training in adults is not large. The solution of 
this seeming contradiction appears to me quite simple, namely: by 
the time the average individual attains mental maturity a con- 
siderable fraction of the mental operations that were originally 
susceptible to modification by training have already been modified 
and have become more or less definitely ‘set’ in a fashion that has 
turned out to be satisfactory as far as the individual’s daily needs 
are concerned. Thus, why expect training in sensory discrimination 
of college students to increase their fundamental eapacity to attend 
to small differences? They have been attending to such differences, 
so far as life necessitated it, daily for twenty years; they are 
already trained, by hours and hours of mental work, to concentrate 
attention, to disregard distractions, to formulate their experiences 
in fitting terms, and in general, to exert a reasonably efficient con- 
trol over their intellectual processes. No, the place to look for strik- 
ing transfer-effects is in the immature. To my mind, this argu- 
ment is well illustrated by the marked difference in outcome be- 
tween the experiments of Dallenbach and those of Foster and 
myself. Neither Foster nor I could discern any marked intrinsic 
improvement in adults (mostly highly selected adults) from our 
training in ‘observation’ under conditions of rapid exposure, but 
Dallenbach found good evidence of a fundamental and permanent 
improvement in this function when he applied an almost identical 
process of training to third-grade school children. 

From these considerations comes the conclusion: The possibili- 
ties of the transfer of training should be found at their maximum 
in children, whose mental activities are still in the formative stage. 
While adults possess a much higher intrinsic capacity than do chil- 
dren to effect the eduction, the conceptuahization, and the applica- 
tions of relationships, this capacity is less evident in practice simply 
because the vast majority of these relations have already been 
educed, conceptualized, and applied under the stress of the extended 
daily experience and training they have already had. 
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5. Obscurity of the Higher Relations | 


Transfer of training through a common content, as we have 
already pointed out, is so obvious as hardly to merit the name of 
transfer. But transfer through community of method, when 
‘method’ has reference to the operation of relations between rela- 
tions, of the ‘higher level relations,’ is often exceedingly obscure. 
Right here is, indeed, the basis for the never-ending arguments 
about the cultural value of the classics and other branches of sub- 
ject matter that have long been favored for their merits as pro- 
ducers of ‘formal discipline.’ Take the case of the advocacy of 
mathematics as a trainer of reasoning. The experimentalist who 
brushes this claim aside with the assertion that ‘‘all training is spe- 
cific’? and that ‘‘there is no such faculty as reasoning’’ must at 
times feel a qualm of conscience and wonder if that line of retort 
may not, after all, ‘‘pour out the baby with the bath.’’ Take the 
case of the pleader for Latin who declares that drill in Latin gram- 
mar teaches the student ‘‘the importance of little things’’ (mean- 
ing on account of case endings and the like). I hold no brief for 
Latin, nor would I ever advocate the study of its grammar as a 
valuable way to learn the importance of the seemingly trivial, but 
I am concerned to point out that what this pleader declares to 
happen really could happen. I mean that it is within the bounds of 
possibility that a Latin student should educe a generalization from 
this study that he never chanced to educe before—‘‘I must pay 
more attention to little things; they sometimes make all the differ- 
ence in the world’’—and within the bounds of possibility that this 
student should remind himself of his generalization ten years later 
when he was called upon to judge the merits of rival candidates 
for a political office. In other words, reverting to the phraseology 
of my introductory paragraphs, no one can assert positively that 
the training received in acquiring Response 34 may not help years 
later in effecting Response 635,782, if 34 and 635,782 happen to 
have some community (even a very tenuous one) in X. But, of 
course, it is equally true that to locate and isolate X and to prove 
that it was common to both Response 34 and Response 635,782 and 
further to prove that it was on account of this community and 
by means of it that the latter response was rendered more efficient 
is, indeed, exceedingly difficult. Let the reader consider some one 
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ability of which he is a bit proud, and let him ask himself just 
where in his early training, in what classroom, by what teacher, in 
what lesson, or in what experience or series of experiences was 
laid the foundation of this ability; if he is candid, he will almost 
certainly have to answer: ‘‘I don’t know.’’ If we are to make 
headway toward a final formulation of transfer, future investiga- 
tions will need to be directed toward the careful analysis of these 
obscure, higher-level intellectual operations, the genesis and func- 
tion of various attitudes and ideals, the development and use of the 
larger generalizations of method—an exceedingly difficult field for 
study. 

My argument here may be stated, therefore, more briefly: It 1s 
not only probable, but fairly certain, that some of the most im- 
portant agencies of transfer are to be found among the higher-level 
relations, in generalized attitudes, moods, ideals, sets, ways of going 
about mental operations generally. These agencies are in the nature 
of the case obscure; the nature and extent of their participation has 
occasioned extended argument, largely futile because so little rs 
known factually. Further study is much needed, but difficult. In 
the meantime the educator is not justified in resorting to any spe- 
cific subject of wmstruction for the purpose primarily of derwing 
from it indirect training values, but neither rs he justified in 
neglecting to derive from every subject all the training value that 
it seems to promise. 


6. Moral Training 


The experimental study of the transfer of training has dealt 
almost entirely, as we have shown, with comparatively simple 
mental operations of the intellectual type, isolated for laboratory 
control and inspection, as, for instance, with rote memory, pitch 
discrimination, speed of adding, estimation of size, ete. Neverthe- 
less, in discussing the agencies of transfer we find mention made 
in these studies, as well as in studies of more complex schoolroom 
activities, of the operations of certain general factors, such as con- 
fidence, persistence, ideals of accuracy, ideals of neatness, that may 
be regarded as quasi-moral, if not moral traits. The claims made by 
proponents of the cultural merits of the classics likewise may -be 
regarded as evidence of a belief—though not, of course, a demon- 
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stration—that a transfer of training exists within the realm of 
moral development. We find on all sides this conviction that moral 
training is gained by transfer. The playground inculcates the 
spirit of ‘‘fair play.’? How? Surely, if at all, by a transfer of 
training just as real as the transfer of training between any purely 
intellectual activities. The point I wish to make is that a very con- 
siderable reliance is placed upon the indirect approach in the de- 
velopment of character. I am almost tempted to say that an un- 
questioned acceptance of the doctrine of formal discipline, of the 
thoroughgoing efficacy of transfer of training, is a conspicuous 
feature of plans for moral training. Opinions will differ as to the 
strength of this foundation for moral development; to some psy- 
chologists at least, transfer seems an uncertain and insecure reli- 
ance for this highly important aspect of education. Whatever be 
the merits of the case, it is certain that we know less about the 
amount of transfer and the agencies of transfer in the realm of 
moral, than in that of intellectual training, despite the fact that 
from a practical point of view it is just in the realm of moral 
training that we should profit most by full knowledge of the possi- 
bilities and limitations of training. To conclude this final comment 
on the pedagogical significance of transfer: 

It is particularly in the field of moral education that there has 
prevailed a general and uncritical acceptance of the dogma of for- 
mal discipline. While it 1s probably true that through the agency 
of the higher-level relations there may occur an important develop- 
ment of certain quasi-moral attitudinal traits, there has been no 
clear experimental demonstration of this transfer and it remains 
precisely in this important field of moral education that we know 
least about the possibilities or the limitations of training. 


PREFATORY NOTE TO CHAPTER XIV 


When mental tests first were used, the opinion was frequently 
expressed that the large individual differences brought to light were 
perhaps due chiefly to the unequal familiarity of the examinees with 
the tasks set, and that among examinees who had been practiced 
well toward their limit these differences would become small or 
would disappear. In that event mental-test scores of unprac- 
tised subjects would lose their significance entirely. If individual 
differences can be eliminated by training, then they are presumably 
largely due to environmental factors. If they persist, in spite of 
equalized training, they are presumably to be attributed to nature 
rather than nurture. That is the problem with which the follow- 
ing chapter is concerned. 

The authors, however, have not investigated the effect of prac- 
tice upon individual differences in mental-test scores, but instead, 
its effect upon individual differences in initial learning scores. It 
is probable that whatever holds true for the latter comparison holds 
also for the former. If different subjects do not maintain much the 
same relative position in initial and final stages of practice in 
mental or psycho-motor performances, a serious disturbing factor 
enters into all our intelligence and personality tests. 

The issue is complicated by the probability that what holds for 
one kind of test or learning material may not hold for another kind. 
It may be that practice decreases individual differences in simple 
psychomotor performances, but increases them in performances 
which tax the higher mental processes. Extensive investigations 
will be necessary to determine the exact facts. The data presented 
in this chapter are interesting, but are not regarded by the authors 
as at all conclusive. The most important result is that, in digit- 
symbol substitution, ecard-sorting, mental multiplication, and 
achievement-test performances, the examineers tend to keep much 
their same relative position at different stages of practice. The 
individual differences here investigated are not greatly altered by 
equalized training. 
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CHAPTER XIV 


THE EFFECTS OF PRACTICE ON INDIVIDUAL 
DIFFERENCES? 


JOSEPH PETERSON 
George Peabody College for Teachers, Nashville, Tennessee 
AND 
Myron Catt BARLOW 
University of Utah, Salt Lake City, Utah 


NATURE OF THE PROBLEM 


Does practice in any mental function or any specific kind of 
performance increase or decrease the individual differences present 
at the beginning of the practice? Again, do the individuals that 
are least efficient at the beginning tend to be least efficient through- 
out the practice or are there frequent changes in relative rank in 
efficiency during the practice or other similar training? These are 
questions of considerable importance both to educational work and 
to vocational direction, and they have received a fair amount of 
attention by experimental psychologists. But the first question is 
yet in the main to be answered. 

As to the answer to the second problem as here stated there is 
now rather general agreement. Numerous investigators, concerned 
with various kinds of performances, have obtained marked or high 
correlations between different pairs of rankings of their subjects 
in different stages of practice. Most of the results in this problem 
have been conveniently tabulated in a recent article by Kincaid? 
and need not be reproduced here. They show convincingly that in 
various kinds of learning and for both children and adults, and 
both feeble-minded and superior individuals, correlations between 
initial and final ability or between ability at any other two measured 


1Mr. Peterson is wholly responsible for the point of view here taken as 
well as for the hypotheses and conclusions presented. He has written the 
article. Mr. Barlow’s work in carrying out the experimentation attributed to 
him in the text was so great that it has been considered fair to make him joint 
author. Grateful appreciation is here expressed to Dr. 8. C. Garrison and to 
Professor Carl L. Adams. 

2Kincaid, Margaret, ‘‘A study of individual differences in learning.’’ 
Psych. Rev., 32: 1925, 34-53. 
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stages are positive and generally rather high. This means that, with 
the possible exception of persons starting out with very unequal 
practice, comparatively few individuals change markedly their rela- 
tive ranks throughout the learning (if, of course, the homogeneity of 
the group is not tco great). The results given agree with data from 
rather extensive experiments by Pyle, Peterson, and Race, with 
learning processes ranging from physical performances, like ball- 
tossing, marble-sorting, and card-sorting, through a variety of 
activities to the more specifically mental operations of sentence 
completion, calculations, and answering questions on facts in his- 
tory.2 Miss Race concludes her monograph by saying: ‘‘In the 
relation of improvability to initial ability in the function itself, 
we find that the measure of initial ability is a measure and a 
prophecy of improvement, that native capacity in any function 
determines what shall be accomplished by that function’’ (p. 55)— 
a conclusion which, though it may hold for the functions investi- 
gated, may or may not apply to ‘‘any function’’ whatsoever. 

The degree of correlation between rankings of subjects in dif- 
ferent stages of practice on a given performance depends, of course, 
on the complexity of the mental functions involved, on the period 
elapsing between the two practice stages chosen for correlation, 
on the relative extents of practice in the particular performance 
studied at the beginning of the experiment, and on other such con- 
ditions. It is obvious, to take an example from the proximity of 
the two practice stages chosen, that higher correlations would be 
obtained between the forty-ninth and the fiftieth practices than be- 
tween the first and the fiftieth or the twentieth and the fortieth, or 
even between the first and the second, because at the beginning of 
practice there is more unevenness in exercise on the particular per- 
formance than at the final stages of practice. 

But these positive, and usually high, correlations between the 
ranks of the different subjects in various stages of learning do not 
necessarily show, as some investigators have supposed, that the sub- 
jects are becoming more alike with practice. They might be either 


* Pyle, W. H., The Psychology of Learning, 1921, 184 ff.; Peterson, Joseph, 
“«Eixperiments in ball-tossing: the significance of learning curves.’’ J. Haper. 
Psych., 2: 1917, 178-224; and Race, Henrietta V., Improvability: Its Interre- 
lations and Its Relations to Initial Ability. Teachers College, Columbia Uni- 
versity, Cont. to Educ., 1922, No. 124. 


THE EFFECTS OF PRACTICE 213 


growing more or growing less like one another and still maintain 
the same rank-order. Of course, some of their learning curves 
might cross and still permit a fairly high correlation between their 
ranks. Moreover, the degree of homogeneity of the group of indi- 
viduals studied will greatly affect the correlations obtained; the 
crossings in learning curves may be chiefly due to inter-changes 
of positions of subjects who are natively nearly equal, while sub- 
jects differing more innately may not change their relative positions 
at all. This latter condition seems to obtain in some of our own 
unpublished results with substitution tests. 

If we put the question as to whether practice reduces indi- 
vidual differences in this form: ‘‘Do individuals of the highest 
mental levels in general show the greatest improvability under 
practice?’’ we get discordant answers from different investigators. 
This divergence of opinion is partly due to the fact that quite 
different methods of measuring progress are used—some of them 
wholly unjustifiable methods. The problem is really not a simple 
one, and we need not be surprised at the use of different methods 
of measurement of improvement. 

A few examples may reveal the nature of the difficulties in- 
volved. Suppose that, in typewriting, Subject A with an Alpha 
(intelligence) score of 180 improves from 40 to 55 words per 
minute, while B with an Alpha score of 170 improves from 60 to 
80 words per minute. If we also put these scores into another form, 
certainly as acceptable as words per minute, say number of minutes 
to write a page of 360 words, we get the data shown in Table I. 


Tasie 1.—Dirrerent Metuops or Mrasurina IMPROVEMENT 


Subject Re Initial Test Score | Final Test Score Pe ieee 
A 130 | 40 words per min..| 55 words per min..| 15 words.| 37.5 
B 170 | 60 words per min..| 80 words per min..} 20 words.| 33.3 
A 130 | 9.00 min. to page .| 6.55 min. to page .| 2.45 min.| 27.2 
B 170 | 6.00 min. to page .| 4.50 min. to page .| 1.50 min.| 25.0 


Which person, according to Table I, makes the greater gain or 
improvement? What is the percentage of improvement of each 
subject? It has been the practice of some writers to say in such 
a case that A makes less absolute gain, and that therefore (A here 
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being representative of subjects with low intelligence in general, 
Since our present figures are only illustrative) subjects with the 
most intelligence gain most. Other writers, having their data 
couched in the second form, have maintained that subjects with 
least intelligence (as measured by some particular intelligence test) 
make most progress. Moreover, the percentage of improvement (as 
the reader will readily see from our figures in the table, and as will 
become more clear if he imagines slightly different absolute scores 
and figures) often contradicts the conclusion reached by a study 
of the absolute ‘improvement’ or ‘gain’ (whichever is preferred: 
they are not the same). And so the confusion grows with the 
accumulation of new data as long as we are not in agreement on 
the method to be used. 

The confusion resulting from comparing absolute gain in 
amount of work done per unit of time with amount of tume required 
to do a given piece of work, I clearly pointed out and illustrated 
copiously with graphs ten years ago,* but the warning then sounded 
and later emphasized® has been heeded by only a few investigators. 
The confusion still exists in most recent studies. This is due to the 
fact that the data yielded by any learning process may take either 
of two forms: (1) average attainment, or accomplishment, per unit 
of time (or per given-number of trials), and (2) time period (or 
number of errors) for the accomplishment of a given unit of work. 
Those investigators who put their data into the first form—average 
attainment—generally find that individual differences increase with 
practice. This is due to the fact that since the means are very 
small in the beginning the additions to them (and to the several 
scores) brought about as the result of practice are at first, abso- 
lutely, very small, however large the percentage gain may be. 
Thus, in ball-tossing, if a group starts with an average of two 
eatches per trial, the absolute gains will, for several trials, be only 
fractions of one unit, whereas when the average number of catches 
reaches over 40 per trial, the ensuing gains in successive trials will 
be about ten units. On the other hand, those who adopt time-units 
for doing a given amount of work will begin with large units which 


“Peterson, J., Op. cit. 

° Peterson, J., ‘‘Thurstone’s measures of variability in learning.’’ Psych. 
Bull., 15: 1918, 452-455; also Peterson, J., ‘‘Johnson’s measurement of rate of 
improvement under practice.’’? Jour. Educ. Psych., 15; 1924, 271-275. 
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show considerable absolute decrease from trial to trial, and these 
unit changes will regularly decrease in amount until they finally 
become very small. In a study of ball-tossing the average number 
_of catches per trial was 2.0 for the first practice period of 200 

catches and 2.3 for the second period—a gain of only .8; but the 
average number of errors for the first practice period was 102.5 
and 84.5 for the second,— an absolute reduction of 18. The meas- 
ures of variability—standard deviation or quartile—follow the 
same law of change, and therefore no measurement of increase or 
decrease in individual differences with practice can be based on 
absolute units, however these are expressed. 

In view of this fact; a percentage variability measure is to be 
recommended,—standard deviation divided by the mean, or quar- 
tile divided by the median. Although technical objections, which 
need not receive attention here, may also be given to the use of this 
method, nevertheless, it seems to be the most serviceable and the 
least deceiving of methods now in use.® It is also advisable to make 
careful studies of the scores of a few individuals occupying extreme 
positions in a group, since it is possible for such individuals to 
diverge with practice, even though the great majority of the mem- 
bers of the group converge and the coefficient of variability gradu- 
ally decreases. The possibility of this misinterpretation of extreme 
cases is particularly to be kept in mind when the method of divid- 
ing quartiles by medians is used, because by this method the upper 
and lower twenty-five percents are disregarded. When a suffi- 
ciently large number of subjects is used, a statistical check on the 
possibility of such a misinterpretation is to divide half of the 
difference between the 90th and the 10th percentiles (or even be- 
tween the 95th and the 5th) by the median.” If this measure of 


*The method of giving percentage gain from an initial score, although 
frequently used, is very erroneous and deceiving. When applied to the two 
kinds of units mentioned above, it gives a different result for each method, for 
reasons which become obyious from our illustration of A’s and B’s scores in 
Table I. 

* While the semi-interquartile range (Q) divided by the median does not 
take account of the changes outside the middle fifty percent of the group, as 
does the coefficient derived from dividing the o« by the mean, a number of 
calculations by these two methods show that the two measures agree from 
group to group very closely—closely enough for the rough determinations of 
the present study. Kelley has shown (‘‘A new measure of dispersion,’’ 
Quart. Pub. Am. Stat. Assoc., 17: 1921, 743-749) that the most reliable inter- 
percentile range is approximately 93P-7P, and that this is but very slightly 
inferior in reliability to the more convenient 90P-10P, which we have used 


216 THE TWENTY-SEVENTH YEARBOOK 


variability increases while the quotient of the quartile over the 
median decreases, we have a clear indication that the extreme cases 
are changing somewhat differently from cases near the central 
tendency. 

Miss Kincaid, in the article referred to, has shown that, treated 
from the standpoint of changes in variability coefficients, most pub- 
lished results on the problem at issue here actually show that indi- 
viduals become more alike under practice. The behavior of ex- 
treme cases was, however, not considered. One of the present 
writers (Peterson) had much earlier come to the same conclusion 
with respect to two investigations studied in some detail,® and a 
more recent study,® comprising a statistical examination of nu- 
merous previous results as well as some original experimentation, 
had also found, before Miss Kineaid’s article was published, that 
in most investigations yet carried out practice has tended to reduce 
individual differences. ; 


SUMMARY OF PREVIOUS INVESTIGATIONS 


A brief summary of the results of previous investigations is all 
that is necessary here. The results given in Table II have in most 
eases been worked out by us or by Miss Kincaid into variability 
coefficients from the original data as published by the several 
authors.?° 


to throw light on the variability (divergence or convergence) changes in the 
near extremes of the distributions. The latter measure is more convenient 
because of the inclusion of the 10th and 90th percentiles in the usual per- 
centile tables. Our results show that occasionally (90P-10P)/Md may show 
divergence or convergence not reflected by the Q/Md coefficient, or even a 
tendency that is the opposite of one indicated by Q/Md. The P.E. of the 
percentile* range between the 10th and the 90th percentile is .600D/Vn, 
D standing for the percentile range and nm for the number of cases. From the 
median and the two percentiles in question the skewness may be directly found. 
There is no skewness if the difference between the average of the 90th and 
10th percentiles and the median is approximately zero, or if this difference is 
small in relation to its P.E., the value of which is .40412 D/yn. Also, for 
mesokurtic distributions Q = .26315 D. If Q/D is less than .26315, the distribu- 
tion is peaked, or leptokurtic; if greater than .26315, it is flat-topped, or 
platykurtic. These specific determinations, however, are of less value to our 
present problem than the noting of changes in kurtosis with practice. The P.E. 
of Q/D is .18736/Vn. (Kelley, op. cit.) 

*See footnote 5. 

® Master’s thesis by M. C. Barlow, Individual Differences as Affected by 
Continued Practice, 1924, in the library of George Peabody College for 
Teachers. 

* For formula for P.E. of the coefficient of variability, see Reitz, Hand- 
book of Mathematical Statistics, p. 77. 
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The quantitative results in terms of variability coefficients in 
the table would vary slightly when ealeulated by different methods. 
For instance, the coefficients of an average score of the first five 
trials, say, as compared with those of the last five would give 
slightly different (and more reliable) results than would coefficients 
of the initial and final single trials; moreover the variability 
measures obtained by dividing the mean variation or the standard 
deviation by the mean (m.v. or ¢/m) will be somewhat different 
from those resulting from dividing the quartile by the median 
(Q/md) since the former are determined by the scores of all the 
individuals while the latter are directly influenced by the scores of 
only a few individuals and are not a function of extreme variations. 
The percentage variability is also influenced by the stages of the 
practice at which measurements are taken. The degree of homo- 
geneity of the subjects’ abilities, ages of subjects, and many other 
factors also have their effects both on the size and the changes of 
variability coefficients, and must ultimately have much more care- 
ful attention than they have thus far received. 


INVESTIGATIONS BY THE WRITERS 
1. <A Preliminary Experiment 


Some time ago we began at Peabody College some preliminary 
investigations of the influence of practice on individual differences. 
Having reason to believe that the practice effect would differ with 
the degree of complexity, we selected for practice a number of 
different performances which were supposed to represent different 
degrees of complexity. The experiments were all carried out under 
conditions so well controlled that the experimenter (Mr. Barlow) 
could assure himself that each subject worked at approximately his 
maximal ability. They concerned practice in (1) digit-symbol sub- 
stitution (material of Pintner and others), (2) card-sorting, and 
(3) mental multiplication. In the first exercise each subject made 
200 substitutions in each of six consecutive days, and marked his 
position at the end of each minute, so that his speed could be 
determined. In the second exercise each subject distributed 120 
eards daily for six days. The cards were distributed into an up- 
right box 30 by 48 inches and 6 inches deep, divided into forty 
compartments—five rows of eight in each row, each compartment 
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being approximately five inches square. The rows were called 
A, B, C, D, and BH, in order, and the holes in each one were num- 
bered from 1 to 8, but no labels were put on the box; the subject 
had to discover and then remember the letter and number of each 
hole. To make the problem clear, each subject was given a fore- 
exercise consisting of the distribution of five cards. In the third 
exercise, the multiplication consisted of computing mentally the 
products of two-digit numbers by two-digit numbers. Each sub- 
ject completed twelve sheets of 40 examples each, working at periods 
of twenty minutes daily till the work was completed. The periods 
ranged from twelve to twenty. The score was the number of sec- 
onds required to work forty problems, as determined from the 
twenty-minute records of problems and part-problems completed. 
The percentage of errors in multiplication and in substitution was 
tabulated, but did not figure in the score. In the eard-sorting the 
errors were noted, but not corrected. The subjects were first given 
a little practice in mental multiplication, after seeing the method 
illustrated by the experimenter before the group; but each sub- 
ject was left free to use the method most easy to himself. The 
card-sorting exercises were conducted individually; the other exer- 
cises, In groups. 

The subjects in these experiments were 20 twelve-year-old chil- 
dren in the sixth-grade of the Peabody Demonstration School. 
Their intelligence had been determined by the Stanford Binet and 
three different forms of the National Intelligence Test. The mental 
ages of the children as determined by the four intelligence tests, 
each given equal weight, were: median, 154 months; Q,, 148.3 
months; Q,, 177.5 months. 

The average of all the correlations between the different pairs 
of the intelligence tests was .61, while that of the correlations be- 
tween successive performances (on different days) in the practice 
exercises were as follows: .87 for digit-symbol, .87 for card- 
sorting, and .86 for mental multiplication. Correlations of initial 
and final practice results are interesting. Initial digit-symbol and 
eard-sorting correlation coefficient is .79; the final correlation, .24. 
Initial digit-symbol and mental multiplication is .12; the final, .16. 
Initial card-sorting and mental multiplication is —.04; the final, .67.~ 

Now we come to the significant question: Do these children 
become more alike or more unlike with practice in the functions 
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involved in our practice exercises? The coefficients of variability, 
a/m, for six successive days of 200 substitutions each are: .16, .14, 
19, 17, .18, and .18; average of first half, .16, of last half, .18. 
The six days of card-sorting for the 20 subjects gave coefficients 
of variability in order as follows: .19, 14, .11, .11, .13, and .12; 
average of first three, .15; of last three, .12. The mental multipli- 
cation exercises gave coefficients for successive groups of 80 prob- 
lems of .26, .30, .30, .31, .83, and .36; average of first three, .29; 
of last three, .33. Our results, then, show a divergence in the indi- 
vidual abilities with practice in substitution and in mental multiph- 
cation, but a convergence in card-sorting. The coefficients for suc- 
cessive sixths of the entire practice, however, are consistent only 
in the case of mental multiplication. 


2. Another Experiment 


Our next results are from a more extensive experiment in 
mental multiplication on 101 subjects, but owing to irregularity 
in the attendance of five, we include the scores of only 96 in this 
report. These 96 students had a median age of 175.9 months; 
Q, and Q, were 166.5 and 182.5 months, respectively. They were 
distributed in grades as follows: 7th, 10; 8th, 37; 9th, 23; 10th, 
23; 11th, 2; and 12th, 1. The practice experiments were admin- 
istered by one of the writers (Barlow) and an assistant, under prac- 
tically the same conditions described above for mental multiplica- 
tion, except that in this case 63 examples, instead of 40, were given 
on each sheet, and the time recorded was that needed by each sub- 
ject to complete a sheet. The rapid students were allowed to work 
more groups of problems than did the slow ones. The results 
(Table III) have therefore been tabulated in four groups for care- 
ful study by one of the writers (Peterson). The groups are as 
follows: A, the 42 pupils who worked through 13 units of 126 
problems each; B, the 24 pupils who completed 10 to 12 units; C, 
the 30 pupils who completed fewer than ten, but as many as 7 units. 
C is, of course, the slowest group. Finally we give in Group D 
the data for all 96 cases through the first 7 units of work covered 
by all the subjects. Coefficients of variability are figured both 
from the quartile and from the 90- and the 10-percentile scores, 
since the former method shows the variability of only the middle 
fifty percent of the group. 
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The ratio of the average of the first six coefficients of Group A 
to that of the last six is .68, showing a convergence of the middle 
fifty percent with practice. The ratio of the corresponding aver- 


ages of the coefficients derived from the 90th and 10th percentile, 


90P:—10RM : 
OMA .86, revealing a less marked convergence among the 


extreme half of the group. This fact is perhaps partly explained 
when one notes that there is actually a divergence with practice in 
the individuals of our Group B, of lesser ability in mental multi- 
plication; the corresponding ratios in this group are 1.28 and 1.14. 
For the more able students the process of mental multiplication 
is simple enough for the members to become more alike with prac- 
tice, whereas for the less able group it is too complex for the exer- 
cise thus to overcome individual differences. But we cannot be 
sure of this interpretation, for Group C shows no appreciable 
change in variability one way or the other. The large group, D, 
however, shows, in conformity with results of most previous in- 
vestigations, that our group on the whole becomes more alike in 
mental multiplication with practice. 

Our own experiments thus far presented are of the most valu- 
able sort because the measurements were derived directly from the 
practices themselves. If one uses any extraneous test of the practice 
effect, one is always liable to the error of using tests at the different 
stages of training which are not fully and equally representative of 
the changing functions studied, and also to the error of employing 
tests not equally fair in degree of difficulty at the different stages 
of practice. Such tests will not give equally representative scatter 
of the subjects at the different stages of training. For instance, 
either a too easy test or a test that does not give psychologically 
equal increments in degree of difficulty to successively higher per- 
centiles of the group may ‘bunch’ the subjects relatively too much 
at certain stages, even though the subjects actually do not con- 
verge or diverge with practice. Even the use of the same test-scale 
for the different stages of the practice is not a guarantee against 
the possibity of the sort of errors mentioned. However, we shall 
now present results of practice tested in some cases by standardized 
tests and in others by the conventional content examinations. 
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38. Results from Achievement Tests 


Four successive groups of children have passed through 
Grades V to VIII, inclusive, in Peabody Demonstration School, 
taking precisely the same achievement tests since the beginning of 
the fifth grade in 1919. The only difference’in the curriculum of 
these four successive groups through the four grades mentioned 
was the addition of two hours a week of home science to the 
eighth-grade programs of the last two groups. The following 
achievement tests were given to each group in every grade at as 
nearly as possible the same time of year (January): The Ayres 
Spelling Scale (twenty words selected from the column marked 
‘73%’? for the grade), The Nassau County Supplement to the 
Hillegas Composition Scale, the Woody-McCall Mixed Funda- 
mentals arithmetic test, and the Monroe Reasoning Test in Arith- 
metic. These several tests, as well as intelligence tests, were admin- 
istered either by, or immediately under the direction of, Dr. S. C. 


Taste 1V.—Meran Scores AND VARIABILITY CoprriciENnTs oF 94 CHILDREN 
TuroucH Grapes V To VIII 


Grades 
Test HA oe 
5 6 7 8 
Spelling (Ayres).......... INFGANS Sa, ele ed ae 15.8 15.7 14.7 1535 
MEV Merce rcceisss NY, .19 .22 17 |No change 
90P-10P 25 32 37 25 
2M 
Composition (Hillegas)....] Mean............ 4.0 4.7 5.0 5.8 
VIVE SNA vereienste tence .16 .18 .15 .14 | Convergence 
90P-10P .25 23 22 23 
2M 
Mixed Fundamentals...... Meant tesretere ethers 22.3 25.9 29.1 29.0 
(Woody McCall) Nae Ny Ena ec ohare al fe .13 .08 -08 |Convergence 
90P-10P 227 19 12 14 
2M 
Arithmetic Reasoning— IMiearitetheientssiciste 15.1 12.5 16.8 8.9 
Accuracy (Monroe)..... Min /Micmctese 1-2 .26 27 .23 .29 |No change 
90P-10P .40 40 39 51 
2M 
Arithmetic Reasoning— IMUGaIN TA eysye c aielesele> 24.5 18.1 22.6 | 18.5 
Principle (Monroe)...... WieWiay/Vdia ons ratsioteye .25 .21 .19 -23 | Convergence 


90P-10P 43 138 .29 .38 (slight) 
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Garrison. The responses were scored by students under his super- 
vision. Scoring and tabulation were carefully checked and re- 
checked to eliminate errors. We are indebted to Dr. Garrison for 
the use of these data, and immediately to Professor Carl L. Adams 
of East Carolina Teachers College for furnishing us with the means 
and mean deviations of these tests, from which we have calculated 
the coefficients of variability. The children were above normal in in- 
telligence, averaging slightly over 107 I.Q., with a first and ninth 
decile of 92 and 132 I1.Q., respectively. In all, 110 children were 
included in the four groups, but we have complete data for only 
94, distributed in the chronological order of the groups according 
to sex as follows: girls, 15, 18, 10, 7, total, 50; boys, 7, 10, 13, 14, 
total, 44. It is to be noted that the girls decrease (with one ex- 
ception), and the boys increase, through successive groups. Table 
IV gives the relevant data for our present purpose. 


TaBLeE V.—MEDIANS AND COEFFICIENTS OF VARIABILITY OF SCORES IN 
SuccEssivE ComMPETITIVE EXAMINATIONS AND IN CERTAIN 
INTELLIGENCE TESTs By 224 STUDENTS IN 
GENERAL PsycHOLOGY 


Examination Tests 
Section 
Aver Atkin- 
1 2 3 4 5 6 i 8 9 | 10 | age |Alpha] Otis son 
Number of Cases 
Are untetais stste oetete 55 158 158/55 j6l) (61 |61 }60' (60° 462 62 BQHe yasietec|l hyatatetere 
Bay shaecclate w-atacstiecs 52 |52 |52 |51 {60 |50 |49 |49 {51 wale a | De 53 49 53 
( CA UOC TORE 46 |47 |46 |48 |49 |/41 |46 /49 |....J.... 49 AO?) We cateciellsonthrane 
10 BAGO s ae mers 49 |45 {50 |51 |51 |49 |50 {51 {52 [53 60 60 SOM eee, 
Median Scores* 
Ae o eie's winispeuenare = 17.3]12.9]12.9}11.3)13.3/18.2]/14.1]11.5)18.1)14.2) 14.2) 125.0]......]...... 
Bis cisiere ansvetbtersis 14.8]19.0]16.8]16. 7)17.4/19.8/16.5)16.3/19.2)....] 17.5) 129.4] 64.5] 13.3 
Cigvatsiere stnteeyorcters 17.0)11.8]19.4]10.8/18.2)11.8]13.0]/18.1)....].... ne iW 58 WA acer cl Niton ON 
DD) Sista varace tu leieere 13.4]13.8]12.0]12.5]14.3/21.3]19.6]16.7]16.0]14.3] 14.4] 130.0] 58.5)...... 
Coefficients of Variability (Q/Md) 
AL taveli she siepseeemals ~29) .22) .21) .39] .22) .14) .19] .24| .14] .17] .13 SPUAL IS ayedaiedey Ilevpttta mete 
Sila retetelersteloteisiors OD} soa) +22) 1.2219.18) .2h)) 28) 29). 10s a 218 aie. .09 .19 
(Oa ecasaocdsOn ceo) «ee LO}. 26h 201.20). 20) LE a. alee Pe Riba sree: acnilla Ronde’ whe 
iS eS See mars Os esOoleeoh ple) 220) s25i Lol. 20lve2dh. 206.49 Pi 2 nO illenet a ote 


*Correlations between medians and quartiles, A, —.52; B, —.20; C, —.24; D, .62 


Table V presents the number of cases, the medians, and the 
variability coefficients of a series of successive competitive exam- 
inations in general psychology and of certain intelligence tests and 


226 THE TWENTY-SEVENTH YEARBOOK - 


one (Atkinson) ingenuity test. The results from four sections in 
psychology—A, fall of 1923; B, fall of 1924; OC, spring of 1925; 
and D, spring of 1926—are given, involving the records of 224 stu- 
dents in Peabody College, mostly sophomores. The sections were 
all taught by one of the writers (Peterson), and the examinations, 
each with a maximum of 30 points, were prepared by him and 
scored by an assistant. The papers were regularly returned at the 
next recitation following the examination, ranked in the order of 
the number of points made. By this means a healthy competition 
is kept up throughout the course. 

In the grade subjects tested by standard tests (Table IV) there 
are two cases, spelling and arithmetic reasoning, in which accu- 
racy is counted, that show no change in variability through the 
four years, but the other three show convergence. There is most 
clear and unambiguous convergence in the case of the Mixed 
Fundamentals test in arithmetic. The results of such tests running 
through years must be interpreted in the light of another fact: 
that is, that in intelligence-test scores there is probably a rather 
marked convergence with increasing age in childhood. This tend- 
ency is noticeable in the data from Myer’s Measuring Minds, 1921, 
page 27, from which we have calculated means, standard devia- 
tions, and coefficients of variability : 


Age in Years...... 6 7 8 9 10 11 12 13 14 
Number of cases...] 516 1191 1237 1440 1304 1318 1169 1204 928 
Means® iirc cccisie tates 11.5 17.0 23.9 29.6 34.8 38.7 44.5 47.8 50.8 
OF nisletercueapisicsys 8.1 9.7 11.6 1251 13.4 13.2 14.4 15.0 14.9 
LY be ORR AC ES Nc Cr .70 57 .49 .41 .39 84 .32 .3l 92 


*Correlation between the standard deviations and means is .97. 


If the Myers test, which is the same for all ages, is to be relied 
upon, we must make allowance in any performance tests for this 
reduction in variability with the advance of age in growing years, 
and it is then possible that the increasing likeness in the less marked 
eases of convergence in Table IV is wholly due to this fact of con- 
vergence in mental ability independently of the effects of training, 
or even in spite of it. Myers tests, however, may not present 
enough room at the top of the scale to give opportunity for the 
older children to represent their abilities as well as the younger 
children. 
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In the case of the psychology students, all adults, there is a 
clear convergence in variability of scores with progress through 
the twelve weeks’ course; that is, the students under training were 
becoming more alike in their abilities in psychology. Even though 
the variabilities which we calculated from the first and the ninth 
decile show this same tendency (we have omitted these data as they 
add nothing to those given), we cannot conclude that a few extreme 
individuals do not show an opposite tendency. It will be noted 
that the variability coefficients of the average scores in psychology 
are about equal to those in Army Alpha, but that the Otis Test 
scores have very much less variability than these. This may be 
due to the greater homogeneity in method or to possible differences 
in materials employed in the test, in appropriateness of time 
allowed, in degree of equality of units employed, or what not. At 
any rate, the difference in variability coefficients is not only marked, 
but consistent, so far as our data go; and it points plainly to a 
possible danger in using extraneous tests of any kind, even con- 
ventional examinations (for there is considerable variability in the 
coefficients of the several psychology examinations), to measure 
effects of practice on individual differences. Jf the successive tests 
do not give similar variability coefficients, they may lead to serious 
misinterpretations, and if they do exhibit gradual changes one way 
or another we cannot be sure this is not due to the nature of the 
extraneous tests, rather than to actual change in the variability 
of the subjects in the function studied. The psychology tests did 
not, of course, cover all the content of the course, neither were they 
uniform in difficulty as the medians and the correlations between 
medians and quartiles show. It is impossible, without careful 
standardization, to make examination questions equal. Moreover, 
tests of various kinds are apt to bunch or distribute subjects differ- 
ently according to different degree of ability, either within certain 
ranges only or within the total range. These shortcomings, varying 
in different examinations and standardized tests, produce a fluc- 
tuation in the coefficients of variability at different stages that does 
not in any true sense reflect changes produced by practice only.™ 


4 The low, and even negative, correlations between the quartile deviations 
and the medians in the psychology examinations show clearly the inadequacy 
of such examinations as determiners of changes of variability in learning. Be- 
sides being what we have called ‘‘extraneous’’ measures, these examinations 
have another defect. In the first part of the course they were technical and 
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They must therefore be scrupulously avoided by means of adequate 
experimental control, rather than by compensatory elaborations in 
statistical methods of handling the data, if investigators are to 
agree on the meaning of the results. Direct measurement of the 
performances under study themselves, at different stages or con- 
tinuously, is to be preferred to the use of what we have called 
““extraneous’’ tests, however well these latter are standardized. 


COMMENTS AND CONCLUSIONS 


Our data have disappointed us by yielding no clean-cut results. 
Indications as to improvement of technique are, however, prefer- 
able to overhasty generalizations. Research suffers more from pre- 
mature conclusions than from indefiniteness in results and conse- 
quent suspension of final judgments. Our results do serve to em- 
phasize the necessity, in future researches along this line, of ob- 
taining scores by the several subjects at the different stages of the 
practice in terms of the whole unit of work actually done, not by 
mere samplings by means of extraneous tests of any kind. On the 
basis of this caution we have grave doubts concerning the validity 
of such conclusions as Kelley’s,”? for instance, that nurture has 
reduced native differences in computation functions more than it 
has the native differences in such functions as are involved in his- 
tory and language information. 

We began investigating our present problem a few years ago 
with the conviction (resulting from certain preliminary data at 
hand) that subjects of normal heterogeneity would become more 
alike with practice on the simpler processes or activities, but more 


quantitative, while in the latter part of the course they gradually changed 
toward the qualitative type, calling for criticisms, evaluations, explanations, 
etc. Thus, in the later examinations discrimination between very good (or very 
poor) answers and those at the central tendency was more difficult. This re- 
sults, as numerous studies have shown, in two things: the raising of the general 
average score, on the one hand, and a lessening of differences, on the other. 
The decrease in the variability coefficient may be only a reflection of these 
changing aspects of the examinations. The low correlation between quartiles 
and medians is easily accounted for in the lack of uniformity in the successive 
examinations with respect to their quantitative aspects. Several examples 
studied show that, as a rule, a hard and highly quantitative examination 
brought the average down considerably and at the same time increased the 
quartile deviation. 

4 Kelley, T. L., The Influence of Nurture upon Natwe Differences, 1926, 
23 ff. 
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different on the more complex activities. We naturally sup- 
posed, then, that if all vocational activities were arranged in a 
series from the simplest to the most complex, there would be a 
point in the series for a given individual, with given abilities, 
below which we could advise him to choose his line of vocational 
activities if he would hold his own in competition with others. 
Industrious practice in an activity below this point would be re- 
warded by his becoming more and more like the. most practiced. 
With certain modifications, particularly with respect to special 
aptitudes, we still hold this opinion and hence regard the whole 
matter as of fundamental importance to education. But our own 
results and those of other investigators have surprised us, and, 
let us confess it, disappointed us. We must question the correct- 
ness of our over-simple original hypothesis. The problem is more 
complex, and possibly also more important, than we had antici- 
pated. There is a probability that our supposed point in the scale 
separating the complex from the simple processes for the repre- 
sentative individual was wrongly estimated, was placed too low— 
a probability that most of the processes we regarded as ‘‘complex’’ 
(in the sense mentioned) are really ‘‘simple’’ (in the sense that 
normal individuals converge with practice on them). In addition 
to this, it is possible also that a greater number of learning 
processes can be carried on simultaneously by subjects of high, 
than of low ability, even in cases where practice on single activi- 
ties results in convergence; or, to put it differently, that even on 
a group of ‘‘simple’’ activities practiced together, normal persons 
will diverge with practice. General observation supports this view. 

It is interesting to note that physical activities, like ball-tossing 
and javelin-throwing, show divergence with practice. Is this be- 
cause of innate peculiarities or because in these activities less defi- 
nite aid can be given the several subjects in their special difficulties, 
so that each must work out his own salvation? Is it possible, more- 
over, that one of the main differences in efficiency is the quality of 
keeping up vigorous activity without constant stimulation? Are 
the poor learners, even though they may gain on their superior 
competitors with proper motivation, as a rule inclined to relax 
effort and to allow themselves to drift into an easy-going pace 
(whatever that may mean in neurological terminology)? But why 
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have subjects under practice generally become more alike in mental 
multiplication and more unlike in addition? Is this because the 
latter is more complex (an incredible assumption) or because the 
former is so complex as to demand of the individual so much in- 
genuity at first as to produce high variability coefficients for later 
comparisons? It is interesting to note that the coefficients where 
the subjects diverge with practice are usually large, but there are 
some conspicuous exceptions. These and many other questions we 
leave to further research in this interesting field. 

In addition to the need of certain precautions already stressed, 
we desire to emphasize here, for future investigations of the 
effects of practice, the necessity of studying carefully the correla- 
tions between the factors represented in the numerator and the 
denominator of the coefficient of variability. It is desirable to in- 
vestigate experimentally the conditions which bring about low or 
negative correlations, and especially to remove such conditions by 
improved experimental control in as far as this is possible rather 
than by compensatory statistical calculations. A special study 
needs to be made of the relationships between the 7 (or D) and 
the mean (or the median) of time-for-unit-of-work data in the 
early stages of learning when the time curves are dropping rapidly. 
Data of the sort needed for the analyses here suggested were in 
most cases not available in the present work. This deficiency was 
due in part to our own ignorance as to the need of certain kinds of 
data at the time the work was begun. Alas, the finer elements of 
technique in any important piece of work come only with practice 
in that work, not before it is begun! 


PREFATORY NOTE TO CHAPTER XV 


This chapter presents a brief survey of the literature (or of 
some portions of it) that deals with the factors underlying the 
appearance of one faily well-defined specific trait—musieal ability. 
Dr. Farnsworth concludes that the question is ‘‘still a moot one.”’ 

The main lines of attack upon this problem have been (1) to 
investigate family resemblance in musicality, and (2) to investi- 
‘gate the so-called ‘elemental character’ of musical abilities. 

The results from the first approach, while fraught with sug- 
gestive implications, have the same inherent limitations that are 
contained in a similar approach to the heredity of intelligence or 
of any other mental trait. Nature and nurture simply cannot be 
extricated from each other by ordinary means of investigation. 

The results from the second approach, as the author has pointed 
out, are also inconclusive, partly because of ambiguity in the minds 
of investigators as to what actually constitutes training that can 
test the elemental character of musical abilities. 
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CHAPTER XV 


THE EFFECTS OF NATURE AND NURTURE 
ON MUSICALITY 


PauL BR. FARNSWORTH 
Stanford University, Palo Alto, California 


There are at least two conflicting views held at the present time 
concerning the relative effects of nature and nurture on musicality. 
One, with Watson (38) as its chief exponent, holds that nurture 
(environment) is all important. ‘‘Our conclusion, then, is that we 
have no real evidence of the inheritance of traits... . . I should 
like to go one step further now and say, ‘Give me a dozen healthy 
infants, well-formed, and my own specified world to bring them 
up in and I’ll guarantee to take any one at random and train him 
to become any type of specialist I might select—doctor, lawyer, 
artist, merchant-chief and, yes, even beggar-man and thief, regard- 
less of his talents, penchants, tendencies, abilities, vocations, and 
race of his ancestors.’ ’’ 

Certain other psychologists, while doubtless willing to give 
nurture a place as one of the causes of musicality, believe that 
nature is more potent. In pursuance of this view they have devel- 
oped ‘capacity’ tests with which to select children blessed by 
nature with superior talents. 

C. E. Seashore in his The Psychology of Musical Talent (27) 
and in numerous articles states again and again his belief in the 
heredity of musicality. To quote one passage: ‘‘Musical talent 
is a gift bestowed very unequally upon individuals. Not only is 
the gift of music itself inborn, but it is inborn in specifie types. 
These types can be detected early in life, before time for beginning 
serious musical education. This fact presents an opportunity and 
places a great responsibility for the systematic inventory of the 
presence or absence of musical talent.’’ 

In this chapter I shall present a fairly comprehensive survey 
of the available data relating to this problem. The material will 
be grouped for the sake of convenience into a number of divisions. 
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I. Genetics 


Hazel M. Stanton (31) aided by an appropriation granted by 
the Carnegie Institution of Washington, studied the abilities of 
eighty-five parents and offspring on four of the Seashore phono- 
graph tests—pitch, intensity, time, and tonal memory. ‘‘These 
measures were supplemented by a systematic interrogation which 
covered questions in regard to musical environment, musical edu- 
cation and training, musical activity, musical appreciation, musical 
memory and imagination.’’ Her results are stated as follows: 
‘‘The harmony of the results with certain Mendelian laws in the 
family distribution tables of assumed gametic formulae is not 
improbable. ’’ 

J. A. Mjoen (21), a firm believer in the biological inheritance 
of musicality, has worked through many family lines. Among his 
reports is the rather interesting case of an extremely musical girl 
in an exceptionally unmusical family. Her case was a mystery un- 
til it was disclosed that she was the illegitimate daughter of a very 
prominent musician. 

Valentine Haecker and Th. Ziehen (14), in two extensive sur- 
veys, found by the questionnaire method that the numerous musi- 
cal factors which they analyzed apparently followed Mendelian 
principles. This work was later extended by Koch and Mjéen (19) 
with similar results. 

Galton (10) believed in the inheritance of musicality, but was 
handicapped in the investigation of the distribution of musical 
taste in families because of the numerous and petty jealousies 
among musicians. Nowhere could he get even a relatively unbiased 
rating of musicians’ abilities. That standards of musical taste are 
even yet quite relative is well shown by Esther A. Gaw’s (11) sur- 
vey of the reports of New York City musie eritics on identical 
concerts. 

According to C. C. Hurst (17) ‘‘That this natural disposition 
or temperament [for music] is innate and hereditary, there can be 
little doubt. Musical associations and careful training can, of course, 
do much in enriching the musical qualities in an individual of a 
musical disposition, but in the absence of the musical temperament 
these outer stimuli are practically powerless.’? For Hurst, musi- 
cality is probably a recessive trait. 
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William Wallace (37) actively combats the conclusions of Hurst 
and shows that it is at least as reasonable to postulate nurture as 
the cause of persistent musicality in family lines as nature. He 
attributes musicality to the ‘‘continuity of vocation.’’ 

A similar view is taken by Mrs. E. F. Copp (5). ‘‘I think it 
[musicality] is a matter of heredity, but that almost everyone pos- 
sesses the heredity.’’ She finds that positive [absolute] pitch can 
be acquired by 80 percent of normal children. There is a differ- 
ence, however, in the speed of learning. Individuals lose the ‘in- 
born’ talent through lack of training. She points out that many 
musical people, like Haydn and Schubert, came from unmusical 
families. The paucity of musical women is regarded as evidence 
against the argument of musical inheritance. 

According to Vaerting (34), lack of creative musical talent in 
women is correlated with the primitive sexual function of vocal 
music. The male is the natural musical performer. 

Both Hans Rupp (24) and Davenport (7) regard musical tal- 
ent as inherited. Drinkwater (9) decides that the ‘musical sense’ 
is probably recessive. 

Stern (32) and Bernfeld (3), however, affirm that musicality 
may be acquired. The latter states that by the exertion of the will 
the unmusical may become musical. He cites a number of cases in 
support of his contention. 

Some work of F. O. Smith (30) may be construed at first glance 
as favoring the environmentalists. Tests of pitch discrimination 
were given two large groups of siblings. One group gave a ccr- 
relation of .48, P. E. .031; the other .43, P. BE. .035. In each case 
a younger child was paired with an older sib. When the younger 
children of group two were paired with other children of the same 
age and sex as their sibs, but who were not related to them, the 
correlation was .53, P. HE. .030. This surprising result may pos- 
sibly be explained in part by probable errors in selection, It is 
quite likely that, on the average, the younger the child, the younger 
the sib with whom he was paired. Since age is an important factor 
in test scores of this type (see the Seashore test age-norms), this 
would in itself give spurious positive correlations. The reliabil- 
ity of his test is also unknown. 
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Voss (36) cites the case of a musical man who had two wives, 
one musical and one unmusical. There were seventeen children 
involved in the five succeeding generations. As in the well-known 
Kallikak study of Goddard, the effects of social heredity were not 
separated from biological inheritance; so the scientific value of this 
study is questionable. 


Il. ELEMENTAL CHARACTER BY TEST 


The difficult question of nature and nurture may be examined 
from the test standpoint, 7.e., we may ask when does a test disclose 
an elemental character? According to Seashore and Mount (28) 
‘‘to the extent that the test is elemental, there should be no ap- 
preciable improvement with practice, development with age or 
variation with general intelligence.’’ 


1. General Intelligence 


Of the three criteria, that of general intelligence is perhaps 
the easiest to meet. If the Stanford Binet be arbitrarily desig- 
nated as its measure, many tests have been developed which have 
a low correlation with it. 

The work of Mrs. Leta Hollingworth (15) has shown that gifted 
children (those with high intelligence quotients), as a group, do 
no better on the Seashore ‘capacity’ tests (excepting possibly the 
test of time discrimination) than do children of average intelli- 
gence. While one should hesitate to conclude that children of high 
intelligence are no more likely to be musical than children of aver- 
age intelligence (Dr. Hollingworth does not make this statement), 
this investigation at least suggests that conclusion. 

In his Psychology of Musical Talent, C. E. Seashore reports 
that not one of his battery of tests correlates to any great extent 
with estimated intelligence. F. O. Smith (30), however, finds cor- 
relations of .70, P.E. .023, between pitch discrimination and general 
intelligence in the case of boys, and of .63, P. E. .026, in the case 
of girls. ‘General intelligence’ was here ascertained by teachers’ 
ratings of reliability and brightness (quickness and accuracy of 
mental grasp—general wide-awakeness). As teachers’ ratings are 
notoriously unreliable, too much stress should not be laid on this 
work. 
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Weaver (39) from a study of 94 subjects, found the following 
correlations between scores on the Seashore battery of music tests 
and the Army Alpha: pitch .35, intensity .24, time .12, consonance 
06, and tonal memory .26. 

The writer has recently correlated scores on the Seashore and 
Kwalwasser tests with Thorndike intelligence seores. For approxi- 
mately 150 university-students the results were: pitch, .14; 
rhythm, .17; memory, .11; consonance —.38 (an elimination of 
four extreme cases changes this correlation tc +.02); time .10; 
intensity, .11; melody —.08; and harmony .22. 

Serial reaction time seems to have musical significance. It 
may be simply defined as a speed test in which each response be- 
comes the stimulus for the next response in an extended chain of 
stimuli and responses. C. F. Hansen (13) reports three groups 
of correlations with this test and the Army Alpha Intelligence Ex- 
amination. Forty-one ‘musical’ college students furnished a cor- 
relation of .51, P. E. .08; sixty-one high-school stenographic and 
teachers’ college students, a correlation of .20, P. E. .08; and 174 
high-school stenographie students, one of .14, P.E. .05. At first 
glance, one might assume that the largest group gave the most 
reliable data, and so state that the Army Alpha Test had little 
correlation with serial reaction time. Another hypothesis might be 
that the Alpha was a content test for the stenographers, but a speed 
test for the more advanced students. 

This latter hypothesis is somewhat substantiated by work by 
R. Seashore, M. Tinker, and the writer. With groups of univer- 
sity students ranging in number from 28 to 35, scores on serial 
action correlated with Part 5 of the Ohio State University Psycho- 
logical Test, Form 10, .14; with Thorndike, .20; with Otis Group 
Intelligence Scale, Advanced Examination, Form A, .30; and with 
the Army Alpha, Form 8, .53. 

In some unpublished work, R. Seashore finds a correlation of 
— .21, P. E. .11, between simple auditory reaction time and Thorn- 
dike intelligence scores. From a study of 47 men at the Univer- 
sity of Iowa the same worker (29) reports a correlation of .10, P. E. 
.09, between percentile rank in the University of Iowa Qualify- 
ing Examination and rank in ‘sense of rhythm’ (Seashore test). 

Schiissler (25) some years ago started a controversy with 
Bartsch on the relation between scientific and musical abilities. Ac- 
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cording to the former’s study of 200 school children, the musical 
are considerably superior to the non-musieal in scientific ability. 
Bartsch (2), from a study of 200 normal-school students who had 
received six years of musical training, reports that the more musi- 
eal have no greater scientific ability than the less musical. Schiiss- 
ler (26) answers Bartsch in a later article. 

The study of Cox (6) on the youthful achievement of men 
deemed great in later life includes a number of musicians. Révész 
(22) and numerous others write of musical prodigies who seem to 
have high scholastic intelligence. On the other hand, there are 
numerous reports of remarkable musical ability accompanied by 
inferior scholastic intelligence. For example, S. J. Holmes (16) 
claims that ‘‘Blind Tom, who possessed a phenomenal aptitude for 
playing any piece of music he may have heard, was practically an 
imbecile. ”’ 

2. Age 


The age factor is not easy to control. Completely to rule it out, 
would mean the construction of a test which gave the same aver- 
age score in large groups regardless of the ages of the group mem- 
bers. Even Seashore, in his well standardized tests, has three dis- 
tinct norms,—for fifth-grade pupils, eighth-grade pupils, and adults 
(see Table I). He justifies this by the introduction of two con- 
cepts,—the physiological and the cognitive limits. All tests aim 
to disclose the absolute, or physiological, limits in an ability. They 
fail to the extent that they become monotonous and uninteresting. 
What they test then is the cognitive limit. Thus, there is more 
chance of tapping the adults’ physiological limits than those of 
eighth-grade pupils, and more likelihood of obtaining the physio- 
logical limits of eighth than of fifth-grade pupils. 


Tasie I.—TypicaL PERCENTILES IN THE SIx SEASHORE Trsts FoR FIFTH- 
Grave Puriis, Erauta-Grape Pupiis, AND ADULTS 


Pitch Intensity Time Rhythm |Consonance| Memory 
Percent Re- 


sponses Correct 
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The statement by I". O. Smith (30) that ‘‘the amount in favor 
of the adult shown in all group statistics is amply accounted for by 
the difficulty in making a reliable test on the young and by their 
lack of information’’ seems hard to reconcile with the fact that in 
intelligence tests the average mental age of the eighth-grade pupil 
is found to be almost, if not quite, that of an adult. 

The work of Hollingworth, reported earlier in this chapter, also 
tells against Smith’s assumption. Forty-nine children whose I.Q.’s 
ranged from 135 to 190, with a median at 153 (Stanford Binet), 
gave about the same means on five of the Seashore phonograph 
tests as do normal children. And this occurred in spite of the fact 
that ‘‘their median intellectual ability was, therefore, at least that 
of the average adult.’’ 

In view of the norms established by Seashore for his rhythm 
test, this statement by DeGraff (8) is surprising: ‘‘It is usually 
believed that age influences rhythmic discrimination; that the older 
the person is, the better his rhythmic discrimination will be, but 
such is not the case, as a glance at the distribution table [his own] 
will show the distribution curve of the three age groups, namely, 
fifth-grade, eighth-grade, and adult, reveals an unusual degree of 
similarity. Although the adult has the advantage of maturity, 
enabling him to comprehend and concentrate on the test, his score 
on the rhythm test is only slightly higher than that of the fifth- 
and eighth-grade school child.’’ 


3. Training 


Among the earliest experiments on training of pitch discrimina- 
tion were those of H. S. Buffum (27). He ‘‘classified 28 eighth- 
grade pupils in a preliminary fifteen-minute individual test for 
each one and then proceeded to train them by requiring 40 minutes 
of the most specific and intensive practice under favorable condi- 
tions for 20 successive days, recording the average achievement for 
each day. The result was greatly surprising. Instead of showing 
progressive improvement with practice, each child remained as 
rated in the preliminary test, with only two exceptions, and the 
' average ability for the class was the same at the end of the 20 days 
of this intensive practice as at the beginning.”’ 
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F. O. Smith (30) has done a somewhat similar piece of work 
with 476 children, which corroborates Buffum. He concludes: 
‘The sensitiveness of the ear to pitch difference cannot be improved 
appreciably by practice. There is no evidence of any improvement 
in sensitiveness to pitch as a result of practice.’”* 

F. B. Ross (23) found that the discrimination of time as meas- 
ured in the Seashore test (old form, not the Columbia record) dis- 
played a marked improvement in eight subjects when the test was 
repeated seven times on successive days. 

In some as yet unpublished studies on musical college students, 
F. A. Wright found that the giving of the Seashore battery of tests 
daily for a week increased the average scores slightly, but not to 
any significant extent. In a number of cases the highest score was 
made at the middle of the week. 

It has been my experience that many subjects with no formal 
musical training make higher scores in distinguishing major, minor, 
and mistuned intervals than do subjects with considerable train- 
ing. According to R. Seashore (29), this is also true with motor 
rhythm. Klauer (18) put subjects through two months of in- 
tensive marching, clapping, beating time, ete., and failed to find 
a significant increase in scores on sensory rhythm discrimination 
over the scores made by untrained control subjects. However, her 
results are complicated in that her control group was retested in a 
much shorter. time than the training group and was obviously 
atypical. That is, its mean percentile was considerably lower than 
her training group’s, so that no accurate comparisons can be drawn. 
R. Seashore (29) reports a correlation of .82, P.E. .02, between 


+The interpretation of results in such mass experiments as those of Buffum 
and Smith is, in my judgment, decidedly difficult, if not often misleading. 
Certainly, an equally important method of studying the effects of practice is 
to confine one’s effort to drilling a competent, though unmusical, adult under 
laboratory conditions which permit some measure of qualitative analysis of 
what takes place. If this is done, as I myself showed years ago in a study of 
pitch discrimination (Amer. Jour. of Psych., 14: 1903, 561-568) not listed in 
the bibliography here, the effect of systematic training of an unmusical person 
is a curiously complex one—there appears marked improvement under certain 
conditions, along with none at all under others. Without going into all the 
details, suffice to say that I was able to train my subject to discriminate cor- 
rectly pitch differences of less than three vibrations (a perfectly laudable 
achievement), but could at will alter conditions slightly and cause her to fail 
utterly to discriminate pitch differences, not of a few vibrations, but of more 
than two octaves. What is one to conclude from that? Can the unmusical be 
trained or can they not?—Lditor, 
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scores on the sense of rhythm test and learning rhythm. The sub- 
jects in this experiment comprised those who stood in the highest 
and the lowest 10 per cent on the sensory rhythm test. 

Seashore and Mount (28) report correlations between musical, 
instrumental or vocal training (as judged by questionnaires) 
and musical capacities (tested by the Seashore tests) which range 
from .39 down to .03. Correlations between musical environment 
(as judged by questionnaires) and capacities ranged from .30 
to —.03. 

In an interesting study by F. M. Brennan (4) training was 
found to correlate significantly (.40 or more) with pitch and 
rhythm discrimination, tonal memory, imagery, simple reaction, 
and intensity control. The training score was the actual number 
of half-hour lessons the student had had. It is well to bear in mind 
that causality is not necessarily expressed in such a correlation 
relationship. As Miss Brennan says, ‘‘In other words, his tonal 
memory need not be good because he had received a certain amount 
of training, but because he is possessed of a keen tonal memory, 
he naturally gravitates toward thinking and working with tones.’’ 

Using a similar training score criterion, De Graff (8) found 
correlations between it and rhythm discrimination of .09 with 464 
adults, of .10 with 282 eighth-grade pupils, and of .21 with 272 
fifth-grade pupils. 

According to the work of Vance (85), pitch discrimination is 
keenest at 128, 256, and 512 d.v.—just the octaves most used in 
music. In fact, the musically trained subjects had better ‘musical 
ears’ at these pitches than the non-musical, but were poorer than 
they were at higher and lower pitches. 

Under the heading of absolute pitch are listed such different 
phenomena as the ability to name a note played on one’s own piano 
or violin, sung by a particular voice, played by almost any piano 
or violin or sung by another’s voice, played with relatively pure 
tones, memory of a certain tonal frequency, ete. It is quite obvious 
from the many studies of this subject that absolute pitch can to 
some degree be acquired. In fact, it is taught by certain teachers, 
although other instructors consider it a doubtful aid to musical 
achievement and so do not encourage its development. It seems 
equally certain that some children display a marked degree of this 
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ability without having had any formal training. This fact has led 
certain writers to consider the ability as inherited—a belief which 
has little to substantiate it, but which cannot be proved faulty until 
more careful analyses of the early musical environments of these 
children are attempted (Gough, Ref. 12). 

Prior to the work of P. F. Swindle (33) and M. F. Meyer (20), 
musical and psychological theorists held pretty generally that 
rhythm was an instinct. It is present to some degree in all races 
of men and thus meets the criterion of universality. If it is 
inherited, however, how does it happen that one finds two-point 
(two-step), three-point (waltz), four- and six-point rhythms in 
human activity, but no five- or seven-point rhythms? (There are 
some exceptions.) Swindle and Meyer both found it possible to 
train subjects so that they eventually ‘felt’ these unusual rhythms 
as intensely as the commoner four and six. They did this by fore- 
ing the subjects to beat out these rhythms day after day. Swindle 
concludes: ‘‘ All experimental evidence seems to point to the con- 
clusion that rhythm is acquired by each individual, and that it is 
not inherited. Biological conditions—for example, the anatomical 
fact that we are two-footed, two-handed, and generally two-sided, 
not three-cornered or starfish-like beings—are favorable for the de- 
velopment of those rhythms which have usually been considered to 
be instinetive, while the other rhythms can be acquired only under 
special, somewhat artificial conditions.’’ 

C. I. Seashore and others still hold that rhythm is an instinct, 
and that the innate capacity in this function can be measured. It 
seems to me that there is a way to reconcile partially these con- 
flicting views. Might it not be that, although the specific rhythms 
one possesses must be acquired, one individual may profit more 
and faster from practice in rhythms than another? These indi- 
vidual differences may be due to nature or nurture—no one knows 
which—but they have musical significance. Rhythm tests would 
then be achievement tests. 

Seashore (27) states that training undoubtedly favors improve- 
ment in the discrimination of consonance, but that the improvement 
is not so great as is commonly supposed. Strangely enough, he 
maintains that imagery is a special gift and that any increase in ~ 
this ability is apparent, but not real. As the analysis of imagery 
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must be introspective, and so personal, I see no way of lifting this 
question from the realm of pure opinion. As to the effect of 
training on tonal memory, Seashore declares that ‘‘those with a 
large capital will improve, not only in proportion to their capital, 
but probably in increasing proportion for the larger capital.’’ 
Seashore summarized the effect of training on pitch as follows: 


At this preliminary stage, the following general conclusions 
may be stated: 

(1) When the proximate physiological threshold has been 
reached, practice is of no avail. (2) So long as a cognitive 
threshold prevails there is prospect of improvement by prac- 
tice to the extent that the cognitive is above the physiological 
limit. (3) This improvement is usually very rapid, often 
immediate and can usually be traced to the acquisition of 
knowledge, through experience or information about what 
pitch is, as distinguished from other attributes of tone. (4) In 
the majority of cases it is possible for the ingenious experi- 
menter to discover the proximate physiological threshold to a 
fair degree of certainty in a well-planned half-hour individual 
test, or in one heterogeneous and one homogeneous group test 
of one hour each; and, for most of the cases in which this fails, 
the same tests demonstrate positively that the threshold is only 
cognitive. (5) The possibility of reaching the physiological 
limit in a single test depends to but very slight extent upon 
whether or not the person tested has had musical education; 
it is mainly a matter of expert skill and ingenuity on the part 
of the experimenter. (6) Ordinarily musical education is not 
effective as a means of improving pitch discrimination. 


The same author cites three lines of evidence to show that in- 
tensity discrimination is elemental: 


Evidence: extended training. Fourteen observers were 
selected regardless of initial ability and subjected to a pro- 
longed series of specific and intensive practice of from four- 
teen to thirty periods of one half hour each, and a record was 
kept of performance from day to day. The general conclusion 
from this experiment was that systematic practice under the 
most favorable conditions did not result in a change of the 
record. On the whole, the achievement of these fourteen ob- 
servers was practically the same at the end of the practice 
periods as at the beginning. 

Evidence: the case of the blind. It is generally supposed 
that, since the blind guide themselves by hearing so exten- 
sively, they must have more highly developed sensitiveness to 
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sound. To test this we took fifteen students in the School for 
the Blind who had been blind from early infancy and were 
normal in other respects, and had acquired skill in various 
kinds of work and ability to orientate themselves without 
sight. We compared the achievements of these with the 
achievements of fifteen seeing high-school pupils selected under 
similar conditions, and the result of this, the most careful test 
of its kind, was to show that the average for fifteen blind and 
fifteen seeing persons is approximately the same. This means 
that the intensified training which the blind have had since 
childhood has not tended on the whole to make them keener in 
this specific sensory capacity. 

We found the same true of the hearing of direction, the 
sense of pitch, the sense of touch, and the sense of weight. 
This goes to show that the fundamental capacities of the senses 
are early developed to their maximum and that further de- 
velopment takes place not in the fundamental capacities but 
in the use of these capacities in more complex forms. 

Evidence: the case of musical training. To these striking 
testimonials to the elemental character of intensity discrimina- 
tion, we may add the records of children of the seventh and 
eighth grades who were classified on the basis of the amount 
of training, as above, and on the basis of intensity discrimina- 
tion. In this we find a trace of agreement in the form of a 
tendency for those who have had extensive musical training to 
rank higher than those who have not. But the tendency is 
slight and is easily accounted for on the theory that those who 
have a keen appreciation of sounds have gravitated toward the 
opportunities of musical training. 


A portion of the argument between the Seashore school and the 
musicians could be settled, I think, by distinguishing certain con- 
cepts carefully. Take the terms ‘capacity’ and ‘ability.’ As de- 
fined by Seashore (27), ‘‘The term ‘capacity’ has reference to the 
inborn or native power; the term ‘ability’ is used to designate 
acquired skill in the use of a capacity.’’ It should be obvious, then, 
that the usual aim of the tester is to measure capacity, but he 
should hesitate to label his test scores measures of ‘capacity,’ as he 
rarely knows the exact extent to which environment has functioned. 
When he states that one’s musical ear cannot be made keener, he 
is usually referring to the capacities in pitch, intensity, ete., and 
he may or may not be correct. The musician, however, well knows 
that, after training, a student can more easily tell when he is on 
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the key. This may be merely a shift from cognitive to physiological 
limit, or it may mean the entry of some other factor. 

Again, take the term ‘training.’ (1) Training may mean 
merely the giving of a test a number of times, as in the studies of 
Wright, Ross (23) and others. (2) Training may mean repeated 
testing plus informing the subject which of his responses are 
correct and which incorrect. (3) Training may include explana- 
tions of the concepts involved, cues as to procedure, and the like. 
(4) Training may mean simply the accidental stimulation of the 
organism over long periods of time, with no formal instruction. Or 
(5) it may mean the formal training of an individual for a con- 
siderable period. Data on the effects of training will have different 
meanings as the meaning of training varies. Investigators should 
be specific in their descriptions of the training factor. 

Certain of the psychoanalytic school hold that the musically 
talented are those who are expressing part of their libido in musi- 
eal activities. Adler (1) points out that both Beethoven and 
Mozart had deformed ears. ‘‘I should only like to emphasize that 
to me the impression of the connection between a beautiful singing 
voice and organ inferiority which I have outlined is a definite one, 
and that I have considerable material to support it.’’ 


CONCLUSION 


From the data considered in this paper, one is practically 
forced to conclude that the question of musical heredity is still a 
moot one. While genetics can illuminate the issue to some extent, 
I doubt that its contributions can ever be conclusive. With human 
stocks it is difficult to make very reliable Mendelian counts. And, 
even where musicality can be demonstrated as prevalent in certain 
family lines for generations, how can the factor of social inheri- 
tance be completely separated from that of biological inheritance? 

Along the lines suggested by Seashore in his criterion of 
‘elemental’ tests, the prospects for worthwhile work seem much 
brighter, and at least a beginning has been made. 
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PuRPOSE AND NATURE OF THE SUMMARY 


In compiling the bibliography for this summary, and in classify- 
ing the summarized material, the general arrangement of the Year- 
book has been followed as closely as possible. I have attempted to 
include at least brief mention of all material to date (June, 1927) 
which bears on the same themes as the original studies published 
in this Yearbook, i.e., material which has offered crucial evidence 
in favor of or against either nature or nurture as determiners of 
mental capacity or of school achievement. Studies which seem to 
me to have made the most important contributions have been 
starred. 

In addition to this clear-cut experimental material, much other 
material has been summarized because of the tendencies it reveals, 
the questions it provokes, the historical importance it holds, or the 
significance it contains as factual background for future research. 
To illustrate, nearly all the data summarized in the section on 
racial and nationality differences; and most of the data on intelli- 
gence in relation to differences in locality, parental status or home 
environment; on family resemblance; and on constancy of intelli- 
gence, fall into this type, as well as a number of other studies under 
practically every rubric of the entire summary. The number of 
such ancillary studies here assembled exceeds the number of the 
crucial type; even so, no pretence is made of covering the litera- 
ture on such topics exhaustively. If this had been done, the Year- 
book would have had room for nothing else. The notes upon some 
of the chief studies of this type (which are rather briefer than the 
notes upon studies of the first type) will at least give the reader 
an orientation in a variety of closely allied fields. 

The summaries themselves are merely condensed, non-critical 
abstracts of the original sources. However, a number of them 
contain material which cannot be adequately interpreted out of its 
original context. Because I chance to have had occasion to read 
these contributions with care, I have taken the liberty sometimes 
of inserting comments which indicate the limitations or points of 
special significance not immediately obvious. These comments are 
always labeled and separated from the summaries proper; they 
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favor one side of the nature-nurture question fully as often as they 
favor the other. 

References have been classified according to the field of evidence 
with which they are concerned. Within the separate classes the 
reader may also find it interesting to note other more or less dis- 
tinet lines of cleavage which correspond roughly to the chrono- 
logical order of the contributions. 


A. Factual Material Leading Towards Hypotheses 


1. Qualitative studies. These point out merely the existence of 
phenomena, often in individual cases, which require explanation. 
Large-scale tabulations of the incidence of genius and of feeble- 
mindedness in families or in groups of defined occupational, cul- 
tural, or racial status are included in this group. 

2. Quantitative studies. Material of this type provides the 
groundwork for analytical studies that aim to separate and evaluate 
nature-nurture influences, though it contributes nothing crucial in 


itself, as for example, studies of family resemblance and constancy 
of the I.Q. 


B. Evaluation of Nature-Nurture Effects 


1. Presence-or-absence studies. These reveal evidences of 
hereditary or of environmental forces, but fail to answer the ques- 
tion ‘‘How much?’’ Studies of differential resemblance of sib- 
lings, fraternal twins, and identical twins fall in this class, as do 
studies of increments in intelligence-accomplishment correlations 
through intensive pushing of accomplishment. 

2. Quantitative studies. These attempt to establish either 

(a) The proportional contributions of nature or nurture to 
deviations in ability under defined conditions, or 

(b) The quantitative effects (in terms of measurable ability 
units) of defined nature or nurture influences. 

There are very few studies of either type (a) or (b) to be 
found. | 

I acknowledge gratefully the kindness of Dr. Rudolph Pintner and 
of Dr. Raymond R. Willoughby who put at my disposal unpublished bib- 
liographies dealing with certain sections of this summary. These bib- 
liographies were of substantial help; but for whatever shortcomings 
this summary may have I alone am responsible, since many phases of 
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the problem treated here were not included in the compilations of either 
Dr. Pintner or Dr. Willoughby. Dr. Pintner, in addition, permitted the 
writer to consult, and to incorporate in this summary wherever possible, 
material worked up by him upon thegeneral contributions of intelligence 
testing to the nature-nurture problem. This valuable material was of 
help in numerous ways in compiling the present chapter. 

For the reader who wishes to familiarize himself with the views 
of current investigators in the field, the following references dealing 
with theoretical discussions or critical summaries rather than with ex- 
perimental data, and hence not summarized in the body of this compila- 
tion, may be useful: 


Burt, C. “The inheritance of mental characters.” Eug. Rev. 4: 1912- 
1913, 168-200. 

Elderton, E. M. “A summary of the present position with regard to the 
inheritance of intelligence.” Biometrika, 14: 1923, 378-408. 

Freeman, F.N. Mental Tests. Cambridge, 1926, 503 pp. (Ch. 17, “In- 
terpretation of intelligence tests.” ) 

Gates, A. I. “The inheritance of mental traits.” Psych. Bull., 18: 1921, 
358-365. 

Holmes, S. J. The Trend of the Race. New York, 1921, 396 pp. (Ch. 5, 
“The inheritance of mental ability.”’) 

McComas, H. C. “The heredity of mental abilities.” Psych. Bull., 11: 
1914, 379-383. 

Peters, W. Die Vererbung geistiger Eigenschaften und die Psychische 
Konstitution. Jena, 1925, 368 pp. 

Pintner, R. Intelligence Testing. New York, 1923, 406 pp. (Ch. 20, 
“The inheritance of intelligence.’’) 

Poyer, G. Les Problémes Généraux de VHérédité Psychologique. Paris, 
1921, 302 pp. 

Spearman, C. “The heredity of abilities.” Hug. Rev. 6: 1914-1915, 
219-237. 

Starch, D. Educational Psychology. New York, 1919, 473 pp. (Ch. 6, 
“The inheritance of mental traits.” ) 

Thorndike, E. L. and others, The Measurement of Intelligence. New 
York, 1927, 616 pp. (Ch. 16, “The measurement of original intel- 
lectual capacity and of acquired intellectual ability.”) 


The reader’s attention is also called to the following volume, which 
contains a bibliography much more complete upon certain phases of the 
problem than the one presented in this summary: 


Holmes, S. J. A Bibliography of Eugenics, Berkeley, 1924, 514 pp. 
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I. INTELLIGENCE 
1. Family Resemblance 


a. Ordinary Family Relationships. The general topic of 
family resemblance has been given first place in this bibliography 
because it involves the fundamental human phenomena which in- 
stigated the nature-nurture problem itself, and because studies in 
the province of family resemblance promise the only possible 
ground-work for future investigations upon the hereditary 
mechanism of mental traits. With the present status of our 
knowledge, however, it would be conceded by almost every one 
that few family-resemblance studies have contributed anything — 
crucial to the general problem as yet, since resemblance conceivably 
might be due either to nature or to nurture or to both in various 
proportions. 

When going over the results of studies summarized here, the 
reader should take special note of the fact that none of the corre- 
lations reported in the literature has been corrected for attenua- 
tion. If the unreliability of tests and ratings had been allowed 
for by such corrections, the correlations would, of course, give 
truer values for the actual amount of resemblance present. 


1. Cobb, Margaret V., and Hollingworth, Leta S. ‘‘The regres- 
sion of siblings of children who test at or above 135 I.Q. (Stanford 
Binet).’’ Jour. Educ. Psych., 16: 1925, 1-7. 

Thirty-seven gifted pupils at P. S. 165, Manhattan, had an average 
I.Q. of 155 by the Stanford Binet. Thirty-four siblings of these pupils 
had an average I.Q. of 129. Thirty-three first cousins had an average 
I.Q. of 127, but the authors think these may be selected. Twenty par- 
ents had a median Alpha score of 142. 


2. Dexter, Emily S. ‘‘On family resemblance beyond the first 
degree of relation.’’ School and Soc. 19:1924, 501-502. 


See section on family resemblance in achievement for summary. 


3. Downey, June E. ‘‘Standardized tests and mental inheri- 
tanee.”’ Jour. of Heredity, 9: 1918, 311-14. 


The following is given with respect to tests of parents and children: 
“Certainly it is significant that of groups of parents and children re- 
cently tested by my pupils the combination of parents, both of whom 
made a record of superior mentality by the Stanford adult scale, gave 
80 percent of children making a superior or very superior record; while 
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when only one parent gave a superior record or both made only an 
average one, the percentage of superior children was only 33. Our 
groups were too small to permit our drawing any dogmatic conclusions.” 


Comment: ‘The number of the cases is not reported. 


4. Elderton, Ethel M. ‘‘On a measure of the resemblance of 
first cousins in man.’’ Eug. Lab. Memoirs, 4: 1907, 1-53. 


Three hundred pairs of cousins were rated for intelligence by vol- 
unteers. The ratings yielded a mean square contingency coefficient of .34. 


5. *Elderton, Ethel M. ‘‘A summary of the present position 
with regard to the inheritance of intelligence.’’ Biometrika, 14: 
1922, 378-408. 


Much of the early Galton laboratory work is reviewed. The fol- 
lowing discussions, however, apparently appear for the first time. 

(1) Kate Gordon’s second series of orphan sibling data reworked. 
Miss Gordon’s pairs contained a number of Jews, Mexicans, Italians, 
etc. Elderton thought that Gordon’s r of .61 for 216 pairs may have 
exceeded the r of .51 (by Pearson’s computation) on the first gI pairs 
because of greater variability, but on using Gordon’s original data, she 
found the variability no greater. 

Gordon had used no child more than once in pairing to obtain her 
r of .61. Elderton now paired the siblings in all possible ways, made 
the table symmetrical, and got .47 = .03. 

Finding an r of —.31 +.03 between age and I.Q., she corrected 
each child’s I.Q. for the 1.59 se so yeas and computed r by the same 
method as above, getting .54 += 

Partialling out C. A. in the: ea way, by Gordon’s method of 
pairing, older vs. younger, r was .58 + .03. 

Using all pairs, and C. A. constant, r was .54 + .o2. 

Using all pairs, C.A. constant and table symmetrical, r was 
53 + .02. 

Comment: ‘The ages and length of time in orphanages are not 
mentioned here. The possibility of early home influences keeps the data 
from being entirely crucial, but the study is none the less very signficant. 

(2) Dr. Drinkwater’s data from a high class and a low class school. 
The numbers of siblings tested by Stanford Binet (and by Binet-Simon 
in the case of some of the younger ones) are not noted, but the P.E.’s 
of r’s are reported. Results are given separately for the two schools, 
but just the combined results are summarized here. A negative r was 
found between intelligence and age in the case of girls but not in that 
of boys—an outcome that is attributed to selection. Ages and grades 
are not given. The undesirability of results based upon two kinds of 
tests is pointed out. 

The r’s for teachers’ estimates are apparently computed by a broad 
category method (5 categories). 
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ra@: Teachers’ Estimate 
r PLE r ies 13) 
Brother=brother: some ne cec ey. .49 04 .49 .04 
Sisterssicterier shu sales amine ee .42 .04 .58 .03 
Above, corrected for age............ .38 .04 .54 .04 
Brother-sister nnc6 1h tee croleoee  ak .53 .02 .49 OZ 
Above, corrected for age............ .53 .02 .50 .02 


(3) Some data from Edinburgh are reported which show r’s of 
— .07 to +.16 between the intelligence of children and such factors as 
drinking of the father, drinking of the mother, morality of the parents, 
physical condition of parents, economic condition of the home (.10 for 
boys and .16 for girls), overcrowding. 

Comment: Drinkwater’s data and the Edinburgh data appear use- 
ful only in establishing relationships. They tell nothing as to which 
is cause and which effect. 


6. Goddard, H. H. Feeble-Mindedness: Its Causes and Con- 
sequences. New York, 1914, 599 pp. 


From 164 family histories of children at the Vineland Training 
School for Feeble-Minded who were considered to be deficient through 
hereditary causes, field workers’ reports upon the probable mental status 
of about 1700 individuals were analyzed. When members of the par- 
ent generations were classified according to probable Mendelian types 
of mating (upon the assumption that feeble-mindedness is a recessive 
unit character) the incidence of feeble-mindedness among the offspring 
fell approximately into Mendelian ratios. While the author is cautious 
in accepting a simple Mendelian explanation as established, he never- 
theless states that the conclusion is justified that feeble-mindedness be- 
haves as if such an explanation were valid. 


Comment: ‘There are significant deviations from the ‘expected’ 
Mendelian ratios in the case of several of the postulated types of ‘Men- 
delian matings,’ and among matings of two feeble-minded individuals, 
six out of 482 offspring are ‘normal.’ This latter could not occur, re- 
gardless of chance, if feeble-mindedness were actually a recessive unit 
character. But this discrepancy by itself does not entirely invalidate 
the unit-character hypothesis as applied to feeble-mindedness, since (a) 
some of the feeble-minded parents of the 482 offspring in question might 
be suffering from a secondary (1.e., acquired and non-transmissible) 
form of deficiency; and (b) the classification of some individuals as 
‘normal’ or ‘deficient’ may be in error. A more serious criticism of the 
conclusions is that, regardless of any approximation to Mendelian ratios 
which the types of offspring from given matings may offer, the hypothe- 
sis that deficiency, when it occurs below a certain arbitrary level upon a 
continuous distribution of human intelligence, is a unit character (14.e., is 
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due to a single gene) is logically untenable. It can be shown (see dis- 
cussion in Part I, Chapter II, of this Yearbook) that the ratios of 
feeble-minded and normal offspring found by Goddard could be ex- 
plained about as well by a hypothesis that is fairly well in accord with 
the observed facts of human distribution as by the untenable hypothesis 
of a single recessive gene for feeble-mindedness. 


7. *Gordon, Kate. ‘‘The influence of heredity in mental 
ability.”’ Children’s Dept. Calif. State Board of Control, 4th 
Bienn. Report. 1918-1920. 


See extension of this report summarized under Elderton, No. 5. 


8. Gordon, Kate. ‘‘Report on psychological tests of orphan 
ehildren.’’ Jour. of Delinquency, 4:1919, 46-56. 


See extension of this report summarized under Pearson, No. 13. 


9. Hart, Hornell. ‘‘Correlations between intelligence quotients 
of siblings.’’ School and Soc., 20: 1924, 382. 


In connection with test work of the Iowa Child Welfare Research 
Station, Army Alpha, National, or Stanford Binet tests were obtained 
from the following groups of siblings: 

A. Children aged 10 to 15 in Davenport, Iowa. 
B. Children of farmers in Sumner ‘'ownship. 
C. Children attending the University schools, Iowa City. 

When I.Q.’s were derived (occasionally from averages of combina- 
tions of tests), correlations between siblings were: 


Group r P.E N 
AG per eretenata re teeter ttole ts teceriahe wsfayolalise ait 6 45 .03 252 
Pyare cistaciasepaie bl cede slarstce te ele pei Guciahs 46 .07 147 
Oi Sark dia ie e cieea eaiele re et aes ae 40 06 219 


Comment: Averaging I.Q.’s from tests of varying types seems 
dubious. The ranges of the groups are not given. 

10. Heymans, G., and Wiersma, E. Beitrige zur speziellen 
Psychologie auf Grund einer Massenuntersuchung. Zeits. fir 
Psych., 42: 1906, 1-127, 258-301. 

See account and reduction of data by Schuster and Elderton. 

11. *Hildreth, Gertrude H. The Resemblance of Siblings in 
Intelligence and Achievement, N. Y., 1925, 65 pp. (Teachers Col- 
lege, Columbia University, Contributions to Education, No. 186.) 


The author analyzed results of tests given by schools and by an 
orphanage. The groups tested were: 
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A. Four hundred fifty sibling pairs from 300 families in a city in 
Oklahoma apparently representative of average American cities. These 
children were not unselected, however, since the schools tried to test only 
bright and dull children for special-class work, and hence dealt mainly 
with dull children. 

B. Three hundred twenty-five sibling pairs from 241 families. This 
group, who attended Horace Mann school, had a disproportionate num- 
ber of bright children. 

C. Two hundred fifty-three pairs representing 146 families. The 
records were obtained from a large Hebrew orphan asylum in New 
York City. Children in the asylum were received at any age above 
5 years. 

The range of I.Q.’s and ages for the three groups was as follows: 


I. Q. CAS 
Group = ae Sa Ral Sa dE eR TP ee 
Qi Med. Qs; Qi Med. Qs 
UNtecs hvaest Veter a Gcetet 72 88 103 ail 11-10 13-3 
Diploma ere th es 108 119 129 5-2 7-5 9-8 


Ohler chen Bacar 85 $6 105 8-0 9-11 West 


Stanford Binet records were available for all (excepting a few 
Oklahoma cases which had been tested upon other Binet revisions). No 
tests were made especially for the investigation. For some of the Okla- 
homa and Horace Mann cases Stanford Achievement Test data were 
available. Care was taken to eliminate children thought to be sibs who 
were not actually sibs. 

In the correlations all possible pairings were used, and the pairings 
were made at random—not oldest with youngest or brightest with dullest. 


CorRRELATIONS BetweEEN I. Q.’s 


Rawr Age partialled out 

r Pes we P. E N 
OkTanOMae ara we Accel ae — 68 .02 47 .02 450 
ino aos MENA Sones Ob ae S25; .03 .08 .04 325 
Hebrew Orph........... «32 04 13 04 253 
(Clove Ko sos che pa .68 .O1 -41 .02 1028 

Correlations Correlations 

Between E. Q.’s Between A. Q.’s N 

r P. E ie P.E 
Oka OM a Aacels cielo .58 04 Dh 06 105 
Horace: Mann: 2.2 ic. -41 06 16 07 83 
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To investigate the influence of environment upon the variability in 
I.Q., data from the orphanage were treated as follows: 


Mean IJ.Q. Sos N 

1. Unrelated children reared together: 
10— 25 percent of their lives.......... 96 ished 94 
50-100 percent of their lives.......... 93 135 94 
2. Unrelated children reared apart......... 96 13.6 200 
owe cue mM psireared apart. onc ven ae. 93 11.4 156 


True sibs reared apart yielded the correlations that follow: 


r P. HE: N 
Pll abire & Sie omer eee eereee et th, Pon 123 .07 78 
Oncipairweachsiamilly mesures alina cere es .24 .10 42 
All pairs corrected for curtailed range...... .495 .06 78 


Under just what conditions tests of the siblings reared apart were 
obtained is not reported. The average time apart is said to be four 
years. The separations occurred from birth to age 12. 

The author concludes: “True siblings reared apart for part of their 
lives still resemble each other about as much as true sibling pairs reared 
together. Unrelated children reared together for part of their lives 
resemble each other in intelligence no more closely than unrelated chil- 
dren chosen from the population at random.” 


Comment: The data suggest an explanation based partly on hered- 
ity, but the author admits that the data are inconclusive. There is no 
way of evaluating the influence of pre-institutional life upon the sibling 
resemblances. On the other hand, the low correlations between siblings 
reared in an orphanage suggest some environmental effect. It would 
be of interest to know whether the correlation of .32 for orphanage 
siblings (or of .13 when C. A. is partialled out) would rise if a correc- 
tion for range were applied. 


12. Madsen, I. N. ‘‘Some results with the Stanford Revision 
of the Binet-Simon Tests.’’ School and Soc., 19: 1924, 559-562. 


Sixty-three pairs of siblings tested by students of the author yielded 
a correlation of .63 £.05. The author writes: “Undoubtedly the cor- 
relation would have been larger if all errors in giving and scoring the 
tests could have been eliminated.” 


Comment: ‘The range of the subjects was not given. Unless un- 
usual selective influences were operative, .63 is a higher r than genetic 
resemblance of siblings should yield. This high value might possibly 
be due, however, to the inclusion of children older than the limit below 
which the Stanford Binet gives zero correlaticn with age, in which case, 
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if the C.A.’s of siblings were correlated, the mental-test correlation 
would be spuriously raised. 


13. Pearson, Karl. ‘‘Inheritance of psychical characters.”’ 
Biometrika, 12: 1918-19, 367-372. 


The article opens with the following comments: “In view of the 
papers that have been published on the inheritance of intelligence, it is 
strange that there should still remain any doubt that psychical characters 
are inherited at the same rate as physical characters. But having regard 
to the existence of that doubt, any material bearing on the point deserves 
special recognition and emphasis.” 

Taking Gordon’s first 91 orphanage sibling pairs (who have a mean 
I.Q. of 93 and S.D. of 16.7), he renders the table symmetrical and gets 
an r of .51 +.05, as compared with .53 obtained by Gordon. Appar- 
ently, her pairing was used, since he used her original table for his 
computation. 


Comment: These data are of course more crucial than ordinary 
data on family resemblance, but resemblance due to home influence in 
early life of the siblings remains a speculative possibility. 


14. Pearson, Karl. ‘‘On the laws of inheritance in man. II. 
On the inheritance of mental and moral characters in man.’’ Bio- 
metrika, 3: 1904, 131-190. 


The correlations quoted [see next page] are based on ratings or 
measurements by members of the family or teachers. The numbers are 
so large that P.E.’s are negligible. 


15. Pearson, Karl. ‘‘On the inheritance of the mental char- 
acters in man.’’ Proc. Royal Society London, 69: 1901, 153-155. 


Comment: The following conclusion, as enunciated in this paper, 
is quoted because it illustrates the extreme, and seemingly unwarranted, 
confidence that is sometimes drawn from data on family resemblance. 
“Dealing with the means for physical and mental characters, we are 
forced to the perfectly definite conclusion that the mental characters in 
man are inherited in precisely the same way as the physical.” See notes 
in this section on Spearman’s criticism of Pearson. See also the fore- 
going summary of Pearson’s 1904 article, setting forth in amplified terms 
the results of this brief preliminary paper. 


16. Pintner, R. ‘‘Training students in group intelligence test- 
ing.’’ Jour. Educ. Research, 9: 1924, 271-280. 


Three groups of siblings, aggregating 106 pairs, yielded correlations 
between .24 and .42. 


Comment: ‘The type of tests employed was not stated. 
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[To be read with Item 14 on p. 258.] 
ADULTS FROM 1,000 FAmiILiEes 


Correlation 
Character rs 
Brothers Sisters Brothers 
and Sisters 
DS CACUT Cerne eetetn te UNE te Aad tg A ea” .6ol .54 55 
SOOT aan dans hero eiere ea At Ae cleo tel SAsie's as 455) .56 53 
Gubit 2.15 Sead 9 ae ee hin gee eet a 49 sta 44 
BV OICOLOLA te oes oe Ree eek 52 -45 46 
(Meant ere vea re ter in ree en Ona Ay) abil. -49 
ScHooL OBSERVATION SERIES 
Correlation 
Physical Characters 
Brothers Sisters Rye 
Pronltn Poe anes. Sena IL ey 52 51 7 ae 
TE VCE COLON ers orteterels ohne teeters cece .54 52 53 
EL AITACOLOT aint see RE La ae .62 35/ .55 
@Wephalictindext tra teas este ses es: .49 .54 -43 
Headslensthaimyrey empresa tee ae on .50 .43 .46 
Headsbresdthnt twat tec ts cee te eres .59 .62 .54 
ead heighten caine foe ns feat 55 52 49 
MWieancteraireerin Gia cceeitn cote oral coal .54 53 51 
AthleticgpOW ete ae rine were eras (2 15 49 
ScHooL OBSERVATION SERIEBS—Continued 
Correlation 
Mental Characters 
Brothers Sisters ees 
VV ICLUY 2) eeepc tat rates Chote oars bridicetero a) are % AT .43 -49 
INSSEL LIV. ENESS Ur edtr. ores ENCE Ha Oiaitons s dceare aa} -44 .52 
nt rOsMee HOMME ipl ers tle tats eieie .59 AT .63 
Popularityn peteniio dere tee esto heise .50 57 .49 
CWonscientiousnessssee cee: .59 .64 .63 
fMemiperiieecicean shischian docks soca sictss 51 49 .51 
TN aK donee oA cote DOO Ee RTE ORA OS CoO .46 47 .44 
Eland waitin eaenecticta atte ceo ceecreaa: .53 56 -48 
hY AG hales We af Fig te a noe rete Mane eae now! Syl! ALY 


17. Pintner, R. ‘‘Mental indices of siblings.’’ Psych. Rev., 
25: 1918, 252-255. 

Yule’s coefficient-of-association technique, applied to mental indices 
of over 180 pairs of siblings, gave a value of .39. Indices were based 
upon results from 6 tests ranging from opposites to cancellation. The 
method of deriving the indices is not stated. 
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18. Rensch, Grace E. A Study of 365 Pairs of Siblings with 
Reference to the Influence of Heredity on Mental Ability. (Stan- 
ford M. A. Thesis, 1921, 35 pp.) 


The Stanford Binet or the Terman Group Test was applied to 
groups of Palo Alto and San Jose siblings ranging from the first to the 
twelfth grade. Barr occupational ratings were also obtained for the 
fathers or for the mothers if the fathers’ occupations were unknown. 
The average of correlations between siblings of various sub-groups was 
.45. The average of correlations between I.Q.’s of children and Barr 
ratings of fathers (or mothers) was .33. 

Comment: Nothing was said of the possibility that occupational 
‘stratification’ of various races represented by the subjects might partly 
account for the rather high correlations between Barr ratings and I.Q.’s. 
The justification for combining in single correlations I.Q.’s based upon 
two different tests is doubtful. 


19. Schuster, E., and Elderton, Ethel M. ‘‘The inheritance of 
psychical characters.’? (Summary of Heymans and Wiersma.) 
Biometrika, 5: 1906-1907, 460-469. 


A summary and reworking of data of Heymans and Wiersma is 
presented. Three thousand Dutch physicians were sent a questionnaire 
asking for family ratings upon 90 psychial characters. More than 400 
of these were returned. Correlations between fathers and sons and 
daughters, and between mothers and sons and daughters averaged be- 
tween .20 and .30. 


Comment: The study has historical importance, but the lack of 
standardization in ratings by the judges renders its correlations 
ambiguous. 


20. Spearman, C. ‘‘The proof and measurement of association 
between two things.’’ Amer. Jour. Psych., 15: 1904, 72-101. 


It is pointed out that correlations between psychic characters based 
upon teachers’ judgments, as reported in Pearson’s work, must be at- 
tenuated by unreliability of judgments. The reliability of such judg- 
ments is not usually above .64, which would bring Pearson’s reported 
correlations of .52 up to .81. However, attentuation may be partly bal- 
anced by ‘dilation’ owing to restrictions of cases to brothers attending 
the same schools. Those with different qualities are perhaps at sea, 
working in offices, etc. 


21. Woods, F. A. Mental and Moral Heredity in Royalty. 
New York, 1906. 312 pp. 


Members of the royal families of Europe, extending from the pres- 
ent back to about the 16th century, and even to the 11th century (Spain 
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and Portugal), were studied. The discussion was chiefly confined to 
individuals who had reached thirty or over. 

By consulting biographical dictionaries and lexicons, Woods graded 
all individuals upon intellect and moral quality on a scale descending 
from 10 to 1. The ratings fell in a fairly normal curve. 

Mental and moral ratings on 608 individuals correlated .34 + .04 
by Pearson’s four-fold method (division at the mean). 

The following four-fold correlations were found, and compared 
with what Woods erroneously takes to be the theoretical r’s. 


Mental Ability 


N Obtained r |Theoreticalr 
Father and offspring.............-+..: 504 31 .30 
Grandfathers and offspring............. 952 .16 15 
Great grandfathers and offspring (ap- 
parently refers to the maternal grand- 
father plus two great grandfathers)... 1,179 15 .075 


It is pointed out as a significant fact that the correlation is as high 
on the maternal side as on the paternal, although maternal grandfathers 
must represent different courts, and hence different milieus from those 
in which the offspring are reared. 

The conclusion is that, since the grandparents . . . “furnish 
their full quota of resemblance,” although the environment is likely to 
be different, “we must then consider the resemblances to be brought 
about through the germ-cells alone.” 

It is shown that inheritors of thrones whose position might 
help them to gain intellectual distinction appear in the top levels no 
more frequently than younger brothers. 

Comment: Environment is not ruled out in any rigorous sense, 
although the environment of the entire group is no doubt superior and 
somewhat uniform. ‘Thus, the results almost surely demonstrate some 
influence of heredity, since differences in environment could not account 
for the large differences in ability. 


b. Twins. The fact that twins resemble one another more than 
ordinary siblings do, and that identical twins resemble each other 
more than fraternal twins do, is strong presumptive evidence in 
favor of heredity. However, we have no way of telling whether 
or not part of the greater resemblance in twins may be due to a 
greater similarity in the environment provided for twins than for 
ordinary siblings. Similarly, in the comparison of identical and 
fraternal twins, we do not know whether the environment of the 
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former may not be more similar than that of the latter. Further- 
more, even if the environment for siblings, and for both types of 
twins could be assumed equivalent, it would be difficult to say just 
what interpretation to put upon the difference in correlation be- 
tween identical and fraternal twins; e.g., how much has heredity 
or environment affected scores if the correlation for fraternal twins 
is .00 and that for identical twins is .85? 


22. Galton, Francis. Inquiries into Human Faculty. London, 
1883, 887 pp. (History of twins, pp. 216-243). 


Data were collected by the author through written inquiries and 
reports from 35 twin pairs of close similarity and 20 pairs of early 
dissimilarity. 

The study is not reported statistically, but many excerpts from com- 
munications are reproduced. In the “similar” group a dozen cases are 
mentioned in which some sickness apparently caused a pair to grow dis- 
similar in some traits (intelligence?). In the few cases in which twins 
similar when young grew diverse “where no external cause can be as- 
signed . . . we may feel sure that it must be chiefly or altogether due 
to a want of thorough similarity in their nature. . . . In some cases the 
resemblance of body and mind had continued unaltered up to old age, 
not withstanding very different conditions of life.” 

In the “dissimilar” group, the author has not a “single case in which 

. correspondents speak of originally dissimilar characters having 
become assimilated through identity of nurture.” 


23. *Merriman, Curtis. The Intellectual Resemblance of Twins. 
(Psych. Monographs, 33: 1924, No. 5, pp. 58). 


The entire twin population of Grades I to VIII in certain cities 
was studied. Data were secured for 105 pairs on the Stanford Binet, 
76 pairs on Army Beta, and 143 pairs on the National Intelligence Test. 
A summary of the chief correlations on these three tests is presented 
herewith. 


PA ae ised sete Guan vee 78) 302; 105)) .84> 302) 76] .89) 01 9143 
INU aaa noo AOmGOD Eo Onae SI Oommen ale Se OO mE Sa eS Omer OS mod 
ALLL O-UGO Ree a cieretersredete store 76 .04 58] .66 .05 48] .88 .02 89 
MC CRSOXee alereletelerernrer a ecoher= Sg O02 men O del iO Lau: O2 me | MeO 2 sae (Nee 2 

Dae cinieieie ole aereleretaveratocye ston by PAN BEAN 507 2 aK} || GS OT Ba 
OE U Giro ctetarctevete ielereretteraor SO eer USMENOS S| ee o 40m OLIN ll me Soumee Oc samo 
Win lik C:SOX Pe iialsre's enetejeiere SOMOS Oona .oo OG MSL |r (anes OSM ADE 
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The author points out that there are higher correlations between 
like-sex pairs than between unlike-sex pairs, that there are no signifi- 
cant differences between the correlations of twins of different ages, that 
special study of pairs reported as “similar” shows even greater likeness 
than that existing in the total group of like-sex pairs, and that cor- 
relations between twins of unlike-sex are about equivalent to those 
usually found for ordinary siblings. An explanation of results in terms 
of heredity is suggested. 


Comment: Apparently the scores with the Beta and the National 
tests were not corrected for age, which, of course, would spuriously 
raise these correlations. (See summary of critique of this study by 
Eugene Shen, No. 26.) 


24. Muller, H. J. ‘‘Mental traits and heredity.’’ Jour. 
Heredity, 16: 1925, 433-448. 


Two young women of 30, identical twins, who were separated at 
two weeks of age and did not see each other until age 18, were tested 
by the Army Alpha, the Otis Self-Administering Test, and several per- 
sonality tests. Physical measurements were also made of the twins. 
Their scores on Alpha were 156 and 153; and on the Otis test, 64 and 
62. Though both had been brought up in the country, one had had 
only four years of formal schooling, including 9 months in business 
college; while the other had gone through the high school and had 
taught school until her marriage 9 years ago. 

Their similarity on the mental tests was most striking, on the sepa- 
rate test elements hardly less than upon the total scores. On the other 
hand, their performances on the personality tests showed no unusual 
similarity. 

Comment: ‘The results with the mental tests have decided sig- 
nificance; but those on the personality tests are hard to interpret be- 
cause of the unreliability of the tests used. 


25. Popenoe, Paul. ‘‘Twins reared apart.’’ Jour. Heredity, 
13: 1922, 142-44. 


These are the twins discussed in detail by Muller in 1926. This 
account contains no test data, but presents brief histories of each twin 
and anecdotal reports concerning simultaneous activities of each with- 
out the knowledge of the other—as sickness, bobbing their hair, etc 
Friends cannot tell their voices apart—or their faces, usually. 


26. Shen, Eugene. ‘‘The intellectual resemblance of twins.’ 
School and Soc., 21: 1925, 601-602. 

Merriman’s conclusion that environment appears to make no sig- 
nificant difference in the amount of twin resemblance is discussed crit- 
ically. Merriman’s data were not entirely crucial because: 
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(1) Reliabilities were not reported so that differences between 
pairs could be separated into true and chance differences (by correction 
for attenuation). 


(2) Age was not allowed for in the group tests—thus giving rise 
to a spurious correlation. F 

(3) The range of the scores for different groups was not reported; 
consequently, the r’s are not strictly comparable. Shen has made the 
data for the various groups comparable by computing the I.Q.’s and 
standard errors of estimate in each case from data in Merriman’s ap- 
pendix. The resulting standard errors of estimate show a quite: con- 
sistent increase of 3 points from younger to older groups in all twin 
classes. 

This might be explained (but is not through Merriman’s treatment) 
by (1) decreasing reliability of the I.Q., (2) increasing variability of 
the [.Q., or (3) increasing effect of environment from lower to upper 
ages. If (1) and (2) are not sufficient to account for it, then (3) must. 


27. Thorndike, E. L. Measurements of Twins. (Columbia 
Univ. Contrib. to Phil. and Psych., 13: 1905, 64 pp.) 


A battery of six mental tests (such as cancellation and opposites) 
and of eight physical measurements was applied to 50 pairs of twins 
aged from 9 to 15. The average correlation, corrected for attenuation, 
of scores on the six mental tests (in terms of age norms) was .78. The 
average of the correlations for the eight physical traits (measured in 
terms of age norms) was .76. The author finds that the young twins 
of the group are as much alike as the older ones, that the twins resemble 
each other more than the ordinary siblings who took the test, and that 
resemblance is as strong in relatively untrained capacities as in trained 
ones. He thereby concludes that the similarities are due to inborn 
capacity. 

Comment: Data are also presented which the author believes 
demonstrate that all twins are of only one type (rather than of both 
identical and fraternal types), but the arguments rest upon such an un- 
certain statistical basis that they can not be accepted in the light of 
more recent studies. 


e. Incidence of Genius or Feeble-mindedness in Family Lines. 
This section is of historical importance chiefly. The studies sum- 
marized here call attention in a forcible way to family resemblance, 
but fail to furnish quantitative statements of such resemblance. 


28. *Brimhall, D. R. ‘‘Family resemblances among American 
men of science.’’?’ Amer. Nat., 56: 1922, 504-547, and 57: 1923, 137- 
152, 326-344. 
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Questionnaires obtained from 956 of Cattell’s starred group of 1000 
men of science, combined with information from biographical sources, 
yielded the following data: 

Brothers of these men are 72.8 times as likely to be in Who’s Who 
as are brothers of the generality, and sisters are 70.8 times as likely to 
be there as are women of the generality. A brother is twice as likely 
to be distinguished as a father, four times as likely as an uncle, and six 
times as likely as nieces and nephews. 

If relatives are taken no more remote than cousins and if both the 
father and the mother are omitted, there are 117 distinguished relatives 
on the paternal side, of whom 30.8 percent were distinguished in science; 
and 110 on the maternal side, of whom 36.3 percent were distinguished 
in science. an 

Comment: This latter result is perhaps the most significant one 
of this study, since the maternal influence was equal to the paternal in- 
fluence, even though the economic status of the immediate families must 
have been chiefly determined by the fathers. 


29. Cattell, J. McKeen. A Statistical Study of American Men 
of Science: a Biographical Directory. New York, 1921, 27 pp. 3d 
Edition. Reprinted from Science, N. 8., 24: 1906, 658-665, 699-707, 
732-742. 

American men of science had far more professional men among 
their fathers than an equivalent number of men of the generality. (See 
Brimhall’s study upon the heredity of men from Cattell’s group.) 

30. Danielson, Florence H., and Davenport, C. B. The Hill 
Folk. L.1., N. Y., 1912, 56 pp. (Eugenics Record Office Memoir, 
No. 1.) 

A family connection called the Hill Folk abounded in feeble-minded 
and alcoholic members. 

31. Dugdale, R. L. The Jukes; a Study in Crime, Pauperism, 
Disease, and Heredity. New York, 1877, 120 pp. 

Of 1200 descendants of a shiftless truant born in 1720, only 20 had 
learned a trade. The genealogy was replete with paupers, murderers, 
and enemies of society. 

32. Estabrook, A. H. The Jukes in 1915. Washington, 1916, 
85 pp. (Carnegie Institution of Washington, Publication No. 240.) 

The author brings the study of the Jukes made by Dugdale in 1877 
up to date. The family stock had not improved. 

33. Estabrook, A. H., and Davenport, C. B. The Nam Family. 
L. L., N. Y., 1912, 85 pp. (Eugenics Record Office Memoir, No. 2.) 
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In the family Nam, 90 percent of the men and 80 percent of the 
women were alcoholic or shiftless. 


34. Galton, F. Hereditary Genius. London, 1869, 390 pp. 


Nine hundred and seventy-seven eminent men each defined as the 
most eminent of 4000 random persons had 535 eminent relatives as close 
as grandfathers or grandsons. Nine hundred seventy-seven men se- 
lected by chance would have only four such relatives. 


35. Galton, F. ‘‘Distribution of successes and of natural 
ability among the kinsfolk of fellows of the Royal Society.’’ 
Nature, 70: 1904, 354-356. 


Family schedules obtained from 200 Fellows of the Royal Society 
showed a large number of relatives as distinguished as the Fellows 
themselves. There was a consistent dropping off in numbers of such 
relatives as relationships became more remote. 


36. Goddard, H. H. The Kallikak Family. New York, 1912, 
121 pp. 


Two family lines established by a soldier in the Revolutionary War 
were compared. One line established by a feeble-minded woman con- 
tained 480 direct descendants among which only 46 normal individuals 
were found. Among’ 496 direct descendants of the line established by 
a normal woman, all were normal with the exception of five. 


37. Jorger, J. Psychiatrische Familiengeschichten. Berlin, 
1919, 494 pp. 


Gives account of the notorious family Zero. 


38. Winship, A. E. Jukes-Edwards: A Study in Education 
and Heredity. Harrisburg, 1900. 88 pp. 


Fourteen hundred descendants of the Jonathan Edwards family con- 
tained a preponderant number of college graduates, professors, and lit- 
erary men. No mental defectives were traced in the family line. This 
line is compared with the notorious Jukes family. 


39. Woods, F. A. ‘‘Heredity and the Hall of Fame.’’ Pop. 
Sci. Monthly, 82: 1918, 445-452. 


Biographical dictionaries list about 3500 eminent Americans. The 
average man has only one chance in 500 to have an eminent man for 
a relative as close as uncle or grandson. But one in five of the 3500 
Americans has a relative among the 3500. Of the 46 Americans in the 
Hall of Fame, more than one in two has an eminent relative. The 
author concludes that something like one percent of the population is 
as likely to produce a man of genius as all the rest of the population. 
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d. Animal Studies. Since the environment for laboratory ani- 
mals is as constant as it is possible to have it, family resemblance 
in animals almost surely indicates hereditary transmission of traits. 
Studies of learning capacity only are summarized below, since the 
few other studies in the literature are concerned with inheritance 
of temperament or of acquired characteristics, two topics not 
touched upon in the body of this summary. 


40. *Bagg, H. J. Individual Differences and Family Re- 
semblances in Animal Behavior. New York, 1920, 58 pp. (Archives 
of Psych., No. 43.) 


One hundred eighty-three white mice and yellow mice were given 
one trial a day in a simple maze requiring only two choices. Seventeen 
trials were given, of which only the last 15 were used in the results. 

The yellow group spent an average time of 83 +7 seconds, with 
an average of two errors. The white group spent an average time of 
27.5 + 2 seconds, with an average of 0.9 error a trial. “The yellow mice 
were thus found to take on the average at least three times as much 
time and to make twice as many errors as did the white mice. 

Data upon maze records for six or seven generations lead the author 
to the conclusion that “there appears to be no particular resemblance 
among individuals of the same litter, as judged by their time-records 
for various tests.” The results with mice of the white and yellow 
strains, however, suggest “a considerable difference among strains.” 

Comment: The study is of importance in demonstrating gross dif- 
ferences in maze-running ability for different strains of rats. The maze 
was too short, and hence too unreliable, and the character of inbreeding 
too uncertain to make an interpretation of the results by biometric or 
genetic methods possible. 


41. Burlingame, Mildred. ‘‘Literature on the heredity of be- 
havior traits in animals.’’ Psych. Bull., 24: 1927, 62-68. 


Summarizes the chief studies briefly, including temperamental traits 
as well as of learning capacity. 


42. *Tolman, E. C. ‘‘The inheritance of maze-learning ability 
in rats.”? Jour. Comp. Psych., 4: 1924, 1-18. 


By selective breeding two independent strains of rats were estab- 
lished, one ‘bright’ and the other ‘dull.’ From a parent generation of 
41 males and 41 females, 9 pairs of bright rats and 9 pairs of dull rats 
were selected upon a rough pooling of time, errors, and number of per- 
fect runs (on a maze 44 by 44 inches on the outside.) 


Sixteen pairs from 18 litters were the parents of a third generation. 
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The animals were given 10 trials on successive days. Omitting 
runs I and 2, the reliabilities based upon odd versus even runs were 
about .38 for errors and .59 for time. 

Correlations between mid-parent and mid-offspring for the three 
generations were as follows: 


Errors Time Perfect runs 
Repviss iy CLS seteatee recs es auerdis an ces .46 .30 .86 
Baavstabou(lG) au wate eae cee ROL — .27 .78 


Means and variabilities of the ‘bright’ and ‘dull’ of the three gen- 
erations were also reported and showed distinct differences between 
the bright and dull strains in the second (F,) generation, but not in the 
third (F,). The author suggests that the convergence in F, may be due 
“to the strict inbreeding which was practiced or . . . the respective ages 
at which the animals were tested.” 

Comment: The study shows distinct evidence for heredity, but 
leaves us in the dark as to the strength of influence of heredity. The 
correlations are ambiguous because they are based upon populations 
from the extremes of the normal distribution; and they are computed 
with the averaged scores of offspring as one variable, which spuriously 
raises them. 


2. Intelligence and Social Environment 


The bearing of studies in this field is indirect unless measured 
changes in environment are shown to accompany changes in mental 
status. Hierarchies of intelligence in social environments of vari- 
ous grades may or may not mean environmental influences; they 
seem at least as likely to indicate the influence of intelligence (of 
individuals or of their parents) upon environment. 


a. Occupational Status. 


43. Alexander, H. B. ‘‘A comparison of the ranks of Ameri- 
ean states in Army Alpha and in social economic status.’’ School 
and Soc., 16: 1922, 388-392. 


Army Alpha rank orders of states (based on white drafted men) for 
44 states are correlated with rank orders in other variables. Rank cor- 
relations are transmuted to r by table. The results follow: 

The author concedes that the results may be interpreted either in 
favor of heredity or environment, but concludes that “Army Alpha ap- 
pears as a test of what has been learned rather than what can be 
learned.” 
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Alpha Rank Order With: r BeBe 

Ratio of men in 1912 Who’s Who to population of states 
Ura SOOM wamiere eee eae hcer ay cael Mer meen mares bs aha les 09 .05 

Percent of foreign-born whites in 1910 census (author 
finds this result cunprising) vs\\e. . 2h fetes ee ech eres 65 06 
Percent urban population in 1910... ....6.006.. 60002 ao .62 07 

Percent owning homes in states having over 50 percent 
rural populationewers pce Oks oo eee ee eek: .68 BOG 
Percent of farms owned by white men.................. STA -05 
average Wagovor farm labor. 9) :.. sme. acetee oly. o> .83 .03 
Hiteracye Ols wines Meee hier Ceca oc.crok Lome heute s: .64 .06 
Ayresischoolndexcmel G00 nn eerie ene iat ee Sean eee ~12 .05 


Comment: ‘The fact that these high correlations were based upon 
group averages must, of course, be kept in mind. This fact causes the 
actual relationships to be greatly exaggerated. (Sce Chapter II of this 
Yearbook, on “Statistical Hazards.”) The causal interpretation is un- 
watranted. 


44, Barr, Frank E. A Scale for Measuring Mental Ability in 
Vocations and Some of its Applications. (M. A. Thesis, Stanford 
University, 1918, 90 pp.) 


The author describes the derivation of a scale based on estimates 
of twenty judges upon the order of 100 occupations in the demands 
made on mental ability. The r of 10 against 10 judgments on the 100 
occupations is .go and .95 by two arrangements. P. FE. is .or. 

I.Q.’s of children in various schools (Palo Alto, Mayfield, Oakland, 
San Jose, Santa Clara) are correlated with Barr ratings of fathers on 
a 14-point division of the scale. On the Stanford Binet, r’s at different 
schools are .69 + .04; .41+.07. With lumped results of Binet and 
some kind of “reliable group test,” Barr ratings correlate .50 = .03 for 
234 cases. 

Comment: It seems possible that these r’s may be raised by foreign 
elements in the population. It is a dubious procedure to lump I.Q.’s 
from two types of test. 


45. Book, W. F. The Intelligence of High-School Seniors. 
New York, 1922, 371 pp. 
The percentages of high-school seniors scoring above the group 


medians were tabled according to the occupational group of their par- 
ents with the following results: 


Protessionval?.s2 otasae/ ages 5.061% OOat-}Hxecutivie {ci ieitsot seus orn et 54 
CGlericala eratpatinlam tae wisi a 60." Day Laborer. ih Pat se cats «eis. 47 
Galesmen citelere wae tesdeie vices s COmm@ Larimer sei kles holes casas b/s 43 
IN TULSAM sesuedeRe erates Accel le ants 55 


City children were also found to score higher than country children. 
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46. Clarke, E. L. American Men of Letters. New York, 1916, 
169 pp. (Columbia University, Studies in History, Economics, and 
Public Law, No. 168.) 

Of 666 American men of letters, 49 percent were descended from 
professional classes and only seven percent from mechanical, clerical, 
and unskilled labor groups. 

47. Cox, Catharine M. Genetic Studies of Genius, Vol. 2. The 
Early Mental Traits of Three Hundred Geniuses. Stanford Uni- 
versity, 1926, 842 pp. 

The occupations of fathers and maternal grandfathers of 282 of the 


most eminent historical personages living between 1450 and 1850 were 
distributed on the Taussig Scale as indicated below: 


Fathers Maternal Grandfathers 
Taussig Rating and Percent 
Classification Freq. | Percent| Freq. a of 182 
9 reported 
1. Professional and nobility ... 148 52.5 77 27.3 41.8 
2. Semi-professional, higher 
business, and gentry..... 81 28.7 65 23.0 35.3 
3. Skilled workmen and lower 
DUSINeSSies eae eon nie 37 oa 35 12.4 19.1 
4° Semi-skilled. 3.50. Mec ec 11 3.9 3 tay 6 
OM mOnskilled ars carsropmace er 3 Iboil 4 1.4 2) Pe 
INO TeCOrdanseieeceredciecne 2 0.7 98 SARTO tee eas 
otal eee etree ate: 282 100.0 282 99.9 100.0 


(See also the section on constancy of I.Q. for summary of other 
significant details of this study.) 


48. Dashiell, J. F., and Glenn, W. D. ‘‘A reéxamination of a 
socially composite group with Binet and performance tests.’’ Jour. 
Educ. Psych., 16: 1925, 335-340. 


About a dozen children in each of three groups were tested by the 
Stanford Binet and by six tests from the Pintner-Paterson Performance 
Scale with results noted herewith. 


Binet I. Q. Performance (points) 
Groups ss 
Av AnD: Av A.D 
Children of faculty members ... Bb 9.0 156 38.2 
LOWE ee erate ate in 100 ffeil 103 Pa fisd\ 
COUNTY Aree ree ee ierse fie 94 8.1 a iti 22.3 
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The authors suggest that the shift in rank of town and country 
groups on the performance scale may be due to “a difference in the 
nature of the two kinds of tests as respects the exact traits measured” 
or to the possibility that capacities of the town and country children 
are differentiated, “not by fundamental nature,’ but by “previous ex- 
perience with the types of things called for in the examination.” “As 
for the faculty group, it is clear that here is a distinctly abler type of 
child.” The authors admit that the data are too few to be conclusive. 

Comment: There is no explanation why the faculty group is equally 
high on both tests, while the other groups shift places. 


49. Dexter, Emily 8. ‘‘The relation between occupation of 
parent and intelligence of children.’’ School and Soc., 17: 1923, 
612-614. 

Tests of 2782 children in Madison (Dearborn Test in Grades I to 
III and National Test in Grades IV to VIII) showed average I.Q.’s 
ranging from 115 for those from the professional classes down to 89 for 
those from unskilled laboring class. 

Taking the average I.Q.’s of children of fathers of definite occu- 
pations, and the average Army Alpha scores of men of the same occu- 
pations from three camps, the rank order correlations were .74, .79, 
and .79. 

Comment: ‘These high correlations must, of course, be interpreted 
as based upon data from group averages. 

50. Ellis, Havelock. A Study of British Genius. London, 
1904, 300 pp. 

One hundred three British men and women of genius were studied. 
They were descended predominantly from upper-class and professional 
families. 

51. English, H.B. An Experimental Study of Mental Capaci- 
ties of School Children Correlated with Social:Status. New Haven, 
1917. (Yale Psychological Studies, Psych. Monographs.) 

Sixty-five percent of a group of lower-class children fell below the 
median of the total group (which included the lower-class children). 


52. Haggerty, M. E., and Nash, H. B. ‘‘Mental capacity of 
children and paternal occupation.’’ Jour. Educ. Psych., 15: 1924, 
559-572. 

In a New York mental-test survey, high-school and elementary- 
school pupils tested on Haggerty Delta 2 showed a consistent drop in 
median level from professional to unskilled laboring classes. 

58. Pressey, S. L., and Ralston, R. ‘‘The relation of the gen- 
eral intelligence of school children to the occupation of their 
fathers.’’ Jour. Appl. Psych., 3:1919, 366-373. 
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When children were grouped according to the occupations of their 
parents, the following percentages scored above the median of the group: 
professional, 85; executive, 68; artisan, 41; laborer, 39. 

54. Rensch, Grace E. A Study of 365 Pairs of Siblings with 
Reference to the Influences of Heredity on Mental Ability. (Stan- 
ford M. A. Thesis, 1921, 35 pp.) 


See summary in section on family resemblance. 


55. Sandiford, P. ‘‘Paternal occupations and intelligence of 
offspring.’’ School and Soc., 23: 1926, 117-119. 


Tests of over 5000 high-school, normal-school, and university stu- 
dents in British Columbia on a modified Army Alpha Test showed dif- 
ferences when students were classified by paternal occupation; but 
median I.Q.’s did not vary so much from class to class as in many other 
similar studies. Six hundred fifty-nine subjects from professional 
classes had a median I.Q. of 105.1, and 456 from ‘unskilled’ classes had 
a median [.Q. of 100.8. The author thinks the lack of differentiation 
may be due to ‘restriction’ at the upper end of the scale. 

Comment: Lack of differentiation is possibly due also to selection, 
since all subjects were in, or beyond, the high school. 


56. Terman, L. M., and others. Genetic Studies of Genius; 
Vol. 1, Mental and Physical Traits of a Thousand Gifted Children. 
Stanford University, 1925. 648 pp. 


The fathers of 560 children of the main group of the Stanford study 
of gifted children were distributed on the Taussig Scale in the follow- 
ing percentages: 


Prokessionaleee wiser c sees ae Bras Whilied labor. wictacwew sss a 11.8 


Semi-professional and_ busi- Semi-skilled to slightly 
NESSH ME Reese ies Cote teres 50.0 skilleda were sete: 6.6 
(a) SElicher eroup.. s+... 931.2) Common labor. Geis hee 0.13 
(bj Gower: TOUDs. <0 65s 18.8 


The author says: “Our data show that individuals of the various 
social classes present differences in early childhood—a fact which 
strongly suggests that the causal factor lies in original endowment rather 
than in environmental influences.” 

57. Visher, 8S. S. ‘‘A study of the type of the place of birth 
and of the occupation of fathers of subjects of sketches in ‘Who’s 
Who in America.’’?’ Amer. Jour. Soc:, 30: 1924-1925, 551-557. 
Also in Who’s Who, 1924-1925. 

The study is based upon persons in Who’s Who, 1924-1925. “In 
proportion to population at the 1870 census, cities contributed nearly six 
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times as many as did farms; villages, nine times as many, and suburbs 
eleven times.” Fathers who were in professional, business, farming, 
skilled or semi-skilled, or unskilled laboring classes contributed sons in 
the proportions 1400, 600, 70, 30, and 1, 

The author comments: “This study does not indicate whether 
‘nurture’ or ‘nature’ . . . is the more important.” 

58. Yerkes, R. M. (Editor) Psychological Examining in the 
United States Army. National Academy Science Memoirs, 15: 
1921, Pt. 3, 890 pp. (Chapter 15. ‘‘Intellectual ratings of occu- 
pational groups.’’) 

Data based upon average Alpha and Beta scores upon men from 


II4 occupations, in 16 camps, show pronounced decrease from the pro- 
fessional to laboring classes of occupations. 


b. Social or Cultural Status. 


59. Bickersteth, M. E. ‘‘The application of mental tests to 
children of various ages.’’ Brit. Jour. Psych., 9:1919, 23-73. 
City and country data. The conclusion is drawn that city children 


are slightly better on reasoning, country children slightly better on 
memory. 


Comment: The possibility that racial differences or selective 
factors, rather than city or country differences, may account for the 
results gives rise to some ambiguity. 

60. Bridges, J. W., and Coler, Lillian E. ‘‘The relation of in- 
telligence to social status.’’ Psych. Rev., 24:1917, 1-31. 

Children in a ‘favored’ Columbus school score higher, age for age, 
on the Yerkes-Bridges Point Scale, than those in an ‘unfavored’ school. 
The authors admit that the results “could be used by adherents of the 
‘environment theory’ as well as by advocates of ‘inheritance.’ ” 

61. Chapman, J. C., and Wiggins, D. M. ‘‘Relation of family 
size to intelligence of offspring and socio-economic status of 
family.’’ Ped. Seminary, 32:1925, 414-421. 

For 632 children, including many foreigners, in Grades VI to VIII, 


Chapman-Sims social ratings and National Intelligence Test I.Q.’s were 
obtained. The correlation was .32 = .o2. 


Comment: ‘The presence of many low testing foreigners may ac- 
count in part for this correlation. 

62. Decroly, O., et Degand, J. ‘‘La mesure de 1’intelligence 
chez les enfants normaux d’aprés les tests de MM. Binet et Simon.”’ 
Archives de Psych., 9: 1910, 81-108. 
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Children of good social class in Belgium tested considerably above 
Binet’s norms. 


63. *Duff, J. F., and Thomson, G. H. ‘‘Social and geographical 
distribution of intelligence in Northumberland.’’ Brit. Jour. 
Psych., 14: 1923, 192-198. 


About 13,000 children, representing practically all those over 11 
and under 13 in certain elementary and secondary schools of Northum- 
berland, were tested on the Northumberland Mental Test. The aver- 
age I.Q.’s of pupils from various classes varied from 121 (children of 
clergymen) down to 91 (hawkers’ and chimney sweeps’ children) and 
88 (children of insane and criminals.) 

Results classified as to locality tend to bear out an earlier sugges- 
tion that children of the cities and remote rural sections test higher 
than those in ordinary rural sections, thus showing that the low aver- 
age scores usually found for rural children may be due to selection 
rather than to environment. 


64. Fukuda, T. ‘‘A survey of the intelligence and environ- 
ment of school children.’’ Amer. Jour. Psych., 36: 1925, 124-139. 


Stanford Binet Tests and environmental scores (based upon a com- 
bination of the Whittier Home and Neighborhood scales) were obtained 
for 257 elementary-school pupils. The median I.Q. was 93. The cor- 
relation between I.Q. and environmental score was .53. 


Comment: Part of this correlation may be due to the presence of 
129 foreign children. 


65. *Galton, F. Hereditary Genius. London, 1869, 390 pp. 


The author writes: “The practice of nepotism among ecclesiastics 
is universal. (Among Popes and other Roman Catholic dignitaries). 
It consists in their giving those social helps to a nephew, or other more 
distant relative, that ordinary people give to their children. . . . I do not 
profess to have worked up the kinships of the Italians with any especial 
care, but I have seen amply enough of them to justify me in saying that 
the individuals whose advancement has been due to nepotism are curi- 
ously undistinguished.” 


66. Gaw, Frances. ‘‘A study of performance tests.’’ Brit. 
Jour. Psych., 15: 1924-25, 374-392. 


Twenty-seven canal-boat children between ages 6 and 13 were tested 
by the Stanford Binet and a battery of 14 performance tests. Binet 
I.Q. and performance I.Q. correlated .68. Educational ratios based 
upon Burt and Ballard tests correlated .26 with performance I.Q. and 
.58 with Binet L.Q. 


Comment: ‘The author states that the results “seem to justify the 
conclusion that the performance tests are less influenced by environ- 
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ment and by lack of schooling than is the Binet scale.” She does not 
suggest the possibility that Binet may be merely a better index of capac- 
ity to learn. 


67. Goodenough, Florence L. ‘‘The relation of the intelligence 
of pre-school children to the education of their parents.’’ School 
and Soc., 26: 1927, 54-56. 


“A total of 213 children, all of whom were between the ages of 18 
and 54 months, were given the Kuhlmann Binet test twice with an inter- 
val between the tests of approximately six weeks. . . . The correlation 
between the I.Q. earned on the first test and the education of the mother 
was .32 + .04; with the education of the father .26=.04. On the sec- 
ond test, the correlation with the mother’s education was .35 = .04; with 
the father’s education .35 = .04. . . . Since the influence of heredity is 
presumably exerted equally by the two parents, the close agreement of 
the correlations suggests that the relatively large proportion of time 
which the child of pre-school age spends with the mother as compared 
with that spent with the father has little effect upon the outcome of 
these tests.” 


68. *Gordon, Hugh. Mental and Scholastic Tests among Re- 
tarded Children. (Uondon Board of Education, Ed. Pamphlets, 
No. 44, 1923, 92 pp.) 


See section on Intelligence and Schooling for summary. 


69. Hirsch, N. D. M. ‘‘A summary of some of the results from 
an experimental study of the East Kentucky mountaineers.’’ Proc. 
Nat. Acad. Science, 13: 1927, Pt. 1, 18-21. 


The conclusion is drawn, in a way not made clear, that environment 
accounts for about 37 percent of the deficiency of the subjects. 


70. Hoffman, A. ‘‘Vergleichende Intelligenzpriifungen und 
Vorschiilern und Volkschiilern.’’ Zsch. f. angew. Psych., 8: 1914, 
102-120. 


One hundred fifty-six boys from a Vorschule and a Volkschule were 
tested by Bobertag’s Binet revision. Ten-year-old boys in the Volk- 
schule averaged no better than nine-year-olds at the Vorschule. 


71. Ide, Gladys G. ‘‘The increase of the intelligence quotient 
through training.’’ Psych. Clinic, 14: 1922, 159-162. 


Five girls are reported in a “private institution endowed for the 
education of fatherless girls’ whose 1.Q.’s rose after they entered the 
school. Two I.Q.’s rose as many as 16 points. 

Comment: Precise evaluation is impossible because of the lack of 
any control group, and the lack of any statement as to how many I.Q.’s 
failed to rise. 
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72. Isserlis, L. ‘‘The relation between home conditions and 
the intelligence of school children.’’ Medical Research Council 
Special Report Series, No. 74, 1923. 

From results of group tests and of teachers’ ratings of intelligence, 
made upon several hundred children, which correlated with environ- 
mental measures, such as social status, condition of clothing, and care 
of home, about .30 to .35, the author concludes that “progressive im- 
provement in home conditions may be expected to react favorably not 
only on the health, but also on the intelligence of school children.” 
Burt, in his preface to the work, says that the author “demonstrates 
that the influence of home conditions upon intelligence is a positive 
one.” But a few paragraphs later he says it “remains impossible to 
decide how far the child from the better-class home is profiting by mere 
environment. . . . and how far he is profiting from heredity, simply 
inheriting the high intelligence which had already guided his parents 
or his ancestors into a superior class.” (See critical notice by Pearson, 
No. 77.) 

73. *Jones, D. C., and Carr-Saunders, A. M. ‘‘The relation 
between intelligence and social status among orphan children.’’ 
Brit. Jour. Psych., 17: 1927, 3438-364. 


The Simplex Group Test (an English group mental test) was given 
to children and their I.Q.’s were computed. The pupils were 9 to 15 
years old. A total of 7oo or 800, from eight industrial schools or 
orphanages, appeared in some of the treatments. The basal age used is 
not stated.- The social origin of the children was rated on a scale very 
similar to the five-category Taussig Scale, on which Class 1 is highest. 

For pupils 9% to 14 in all the schools, the following mean I.Q.’s 
were found: 

1. Pupils in residence from 0 to 3 years 


OCCUpaIGN ClASSTL ATIC) 2 cists chaus-s +15 shel oictehass 107.5 

OccupationsGlassea men sre ctte tree ay siete 97 

Occupations ClassrApandasiee cee eel trons 94.5 
2. Pupils in residence over 3 years 

OccupationiGlass hand. 2/00 Sasi cnn. soem ee 106.5 

CCH PAMON CLASS ES t i GAO tacts cialdes srere uae oka 98.5 

Occtipation, Glasserandscuare terse ele 96.5 


Thus, the differences tend to decrease slightly with length of resi- 
dence, 0-3 years and over 3 years. 


Dirrerence IN Mean I. Q. Accorpine to LENGTH oF RESIDENCE 


I. Q. Greater In Earlier Period I. Q. Greater In Later Period 
Diff. | P.E. Diff. | P.E 
Occupation 1 and 2...... ON /ie | eSamh OCCUDSLION Oo aasmee ener 1.0 | 2.4 
Oicevpation Wee sie. -jeveiece « 4.3 1.8 | Occupation 4 and 5..... 0.6 IAF 
OCCUPATION se vin cise eis 5.9 3.0 | Occupation 4 and 5..... Wess |) abe 
Occupation 4 ond 5...... 0.7 2.5 | Occunation 4 and 5..... 2.1 1.8 
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The pupils are classified by schools and seem to show a differential 
effect of environment on different social classes. 

The authors conclude: “There is no reason to suppose, so far as 
this evidence is concerned, that environmental influences are the whole 
or even the major part of the cause of the differences in intelligence 
between children of different social origin.” 

Comment; The result of this study may be compared with interest 
with those of the Chicago and the Stanford studies of foster children 
appearing in this Yearbook. Environmental effects were found in these 
two studies of about the same magnitude as those in the present study 
(6 or 7 points). The possibility of chronological selective factors, which 
in orphanage studies of the past has not been ruled out, seems not to 
cloud seriously the resuits of this study, since the average I.Q. of one 
class of children is seen to increase with length of residence, and of 
another class of children, to decrease. 


74. Kornhauser, A. W. ‘‘The economic standing of parents 
and the intelligence of their children.’’ Jour. Educ. Psych., 9: 
1918, 159-164. 

Of the families of 1000 Pittsburgh school children, 32 percent had 
telephones. In families of children retarded in school one half year or 
more, 19 percent had telephones; 32 percent of families of children 
normally advanced, and 50 percent of families of accelerated children 
had them. 

75. Morlé, M. ‘‘L’influence de 1’état social sur le degré de 
l’intelligence des enfants.’’ Bull. Soc. Libre Educ. Psych. de 
V’Enfant, 12: 1911, 8-15. 

Thirty random children from a ‘favored’ school and 30 random 
children from an ‘unfavored’ school were compared. The children from 
the poorer section were, on the average, one-fourth year behind the 
level of their age. Those of the ‘favored’ school averaged from one- 
fourth to one-half year accelerated. 

76. Odin, Alfred. Genése des Grands Hommes. Gens de Let- 
tres Francais Modernes. 2 vols., Paris, 1895. 

French men of letters were relatively more often born in cities than 
in the country. The author believes that this fact indicates an environ- 
mental influence upon literary achievement. 

77. Pearson, K. ‘‘Critical notice: on the relation between 
home conditions and the intelligence of school children.’’ Bio- 
metrika, 15: 1923, 161-172. 


Pearson recomputes most of Isserlis’ data, getting results which do 
not agree perfectly with his. Isserlis is criticized severely for (1) not 
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telling how his etas and contingency coefficients were computed; (2) not 
knowing from the literature that Heron and Gilby had obtained similar 
results before; (3) drawing conclusions when numbers were too low 
and P.E.’s too high; (4) partialling out variables when r’s are not 
based upon the same cases; (5) concluding that “progressive improve- 
ment in home conditions may be expected to react favorably ‘not only 
on the health but also on the intelligence of school children.” He also 
criticizes Burt for claiming in the preface that home conditions and 
intelligence correlate between .3 and .4 when “all irrelevant factors have 
been eliminated.” Only C. A. was partialled out. (See also the sum- 
mary of the study of Isserlis, No. 71.) 


78. Pintner, R. The Mental Survey. New York, 1918, 116 pp. 


Groups of city school children averaged uniformly above the aver- 
age level of village and rural school children. 


79. Pressey, S. L., and Thomas, J. B. ‘‘A study of country 
children in (1) a good and (2) a poor farming district, by means of 
a group scale of intelligence.’’ Jour. Appl. Psych., 3:1919, 
283-286. 


In a study of 2800 city children and 538 country children, propor- 
tions varying from 21 to 33 percent of country children equalled or 
exceeded the median of city children. 


80. Root, W. T. ‘‘The intelligence quotient from two view- 
points.’? Jour. App Psych., 6: 1922, 267-275. 


The author tables tests and retests of six children taken from in- 
ferior homes and placed in “a very superior institutional home on the 
cottage plan.” Changes in I.Q. (after 12 to 19 months) ranged from 
— 9g to + 14, with an average of 5.5 points. 


81. *Schmitt, M. ‘‘Der Einfluss des Milieus und anderer 
Factoren auf das Intelligenzalter.’’ Fortschr. d. Psych., 5:1919, 
217-255. 

One hundred children, aged 4 to 16, in two Wirzburg orphanages 
were tested by a translation of Binet’s 1911 Scale. The M. A.’s of the 
older children were corrected for the inadequacy of the scale at the 
upper end. All the children had come from an originally poor home 
environment. 

When the subjects were divided into groups each covering a range 
of two years C.A., no significant difference was found in these groups 
between the proportion of mental retardation of those who had been in 
the institutions longest and of those who had been there shortest, or of 
those who had come from a poor moral environment and of those who 
had come from a better moral environment, or of those whose fathers 


~ 
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had been industrial workers and of those whose fathers had been 
peasants. A slight advantage was found in favor of those coming from 
the less abject material surroundings, but the author points out that this 
fact merely corroborates previous studies on economic status and proves 
nothing. 

There is no consistent tendency for children within the various age 
groups who have been in the institutions longest to show less mental 
variability than those who have been there for a shorter time. 


Comment: The author admits that the small numbers prevent re- 
sults from being conclusive, but believes that any “large influences” 
would nevertheless be revealed if present. 

Although the results appear to be negative with respect to environ- 
mental influence, there is the possibility that some type of selective influ- 
ence may have covered up an existing environmental influence. 


82. Strong, A. C. ‘‘Three hundred fifty white and colored 
children measured by the Binet-Simon Measuring Scale of Intelli- 
gence; a comparative study.’’ Ped. Seminary, 20:1913, 485-513. 

So far as the white children were concerned, less than six percent 
of the city school children were retarded, while 18 percent of mill- 
district children were over a year below the mental level of their age. 
None of the mill-district children was mentally accelerated, although 
10 per cent of city school children scored above average. 

83. Terman, L. M. The Stanford Revision and Extension of 
the Binet-Simon Scale. Baltimore, 1917, 179 pp. 

The I1.Q.’s of 492 children out of the original Stanford Binet 
standardization group correlated with social status (on a five-point 


scale) to the extent of .40. Median I.Q.’s for children from five social 
levels were: 


Social Group LO: 
PVISTYSITTEOTIOR Ts ialey steieisha art acl siete tenane eG 85 
IbalPSg Oe Ss Bra Bopboblamemdeaio ase bom a 93 
ISSUERS ey GES CeO yb Mee GCE ORO One 99.5 
SUPE OR Lwer an eo eziatels, ates acs hee alete.a ae 107 
iG Tay, ISU COS cael sts a « aid Hare love asaya cate ests 106 


84. *Terman, L. M. The Intelligence of School Children. 
Boston, 1919, 313 pp. 


Some individual cases are cited of (a) children showing good in- 
telligence in poor environment, and (b) children showing no improve- 
ment when placed in a good environment. A child of seven who had 
never been to school, though he had excellent home advantages, tested 


at 153. 
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Gypsy Mary, aged 16, stolen by gypsies when she was about four, 
had had no schooling. She tested at approximately 100, and in a year 
after being freed was ready for the high school. 

Two boys were adopted at ages five and seven from a very poor 
home. Their I.Q.’s were 73 and 72. Four years later, after exceptional 
advantages, their I.Q.’s were only 70 and 77. ) 


85. Terman, L. M., and Wagner, Dorothy. ‘‘Intelligence quo- 
tients of 68 children in a California orphanage.’’ Jour. of Delin- 
quency, 3: 1918, 115-121. 


Children, originally from poor homes, who had lived in an orphan- 
age anywhere from a few days to 11 years had a median I.Q. of 92.3. 
“The correlation between intelligence quotient and time in the institu- 
tion . . . was found to be slightly negative (—.16). This gives no 
support to the environment hypothesis.” 


Comment: Since there would probably tend to be a positive cor- 
relation between age and length of residence at the orphanage, and 
since the Stanford Binet I.Q. shows a slight negative correlation with 
age, the slight negative correlation of I.Q. with residence might perhaps 
be expected. 

The lack of environmental influence cannot be considered as thor- 
oughly established, since selective influence upon children of various 
ages might conceivably be sufficient to cover up the influence of en- 
vironment if it existed. 


86. *Theis, Sophie V. S. How Foster Children Turn Out. 
N. Y., 1924. (State Charities Aid Ass’n., Publ. No. 165.) 


Data were collected by 16 trained field workers upon 910 former 
wards of the State Charities Aid, 18 years old or over by 1922. Of 
these, 83 percent were under 25. ‘Thirty-five percent were under five 
when placed, 21 percent between five and 10, and 45 percent between 10 
and 19. It was possible to secure considerable information upon 797 
of the gio. 

For each of 797 subjects, information was secured on “capability 
to manage himself and his affairs with ordinary prudence.” Of the 
total group, 77 percent were ‘capable’ and 23 percent were ‘not capable.’ 

When 604 subjects who had had only one kind of home since place- 
ment (superior, good, or mediocre with respect to cultural and material 
status) were compared for the proportion ‘capable’ from each type of 
home, no home influence was manifest. Negative results were also 
found when the proportions of ‘A,’ ‘B,’ and ‘C’ subjects, in ability to take 
formal education, were tabulated against type of foster home. (‘A’ 
ability was defined as “capable of progressing beyond the eighth grade” 
or “having an intelligence quotient of 80 or more.”) 

Of 754 children who had had only one kind of care since place- 
ment, 12 percent had had excellent care, 68 percent average care, and 
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20 percent poor care (by the standards adopted for the study). Of those 
“who had excellent care, approximately 87 percent developed into 
‘capable’ subjects; of those who had average care 80 percent are 
‘capable,’ and of those who had poor care 66 percent are ‘capable.’ In 
other words, when excellent care was given, more of the subjects de- 
veloped into capable persons.” 

When the type of family background from which the subjects had 
originally come was tabulated, it was found that, in a group of 693 sub- 
jects, 435 had predominantly ‘bad’ family background; 65 had a mixed 
‘good-and-bad’ background; 44 had a ‘good’ background, and 149 an 
‘unknown’ background. The percentages of ‘capable’ subjects in these 
four classes were 71, 80, 83, and 81. This comparison seems to favor 
children with better background, but according to the author the dif- 
ferences here reported are not statistically reliable. Corresponding per- 
centages of “A” subjects in ability to take formal education were 63, 
67, 92, and 77. The differences in this case are regarded as significant. 

Of 271 subjects placed under the age of five whose ‘capability’ was 
known, 86 percent were capable. Only 73 percent of children placed 
at five or over were capable. Ninety percent placed under five years 
of age were rated ‘A’ on ability to take education as against 65 percent 
of those placed at five or over. 

Of 155 subjects one or both of whose parents had been mentally 
inferior, 68 percent were capable, and 53 percent were rated “A” on 
ability to take education. Of 82 subjects known to have had good abil- 
ity in their parentage, 80 percent were capable, and 72 percent were 
pated f\.0 


Comment: No unjustifiable conclusions appear to be drawn from the 
data, which, as the author admits, lack precision. Upon the statistical, 
as well as upon the clinical, data, the author bases the statement that 
“undoubtedly the child’s adjustment to his foster family governs to a 
significant degree his adjustment to society, and his adjustment to his 
foster family has less to do with their standards of comfort and their 
place in the community than with their human qualities and their under- 
standing.” ‘The results as regards ‘capability’ and ‘A’ ratings seem to 
favor heredity somewhat more than environment, but, as the intro- 
duction by Homer Folks points out, “We cannot disentangle the factor 
of inheritance from that of early life with the children’s parents and 
the environment provided by them.” 


87. *Thomson, G. H. ‘‘The Northumberland Mental Tests.’’ 
Brit. Jour. Psych., 12: 1921, 201-222. 

The pertinent part of the study concerns 64 pupils in small schools 
in the Cheviot valleys (very remote), two schools in a poor suburb of 


Newcastle involving 64 pupils, and 111 pupils in two schools in a well- 
to-do suburb of Newcastle. The Northumberland Mental Tests (whose 


282 THE TWENTY-SEVENTH YEARBOOK 


norms were based upon 2532 cases) were given chiefly for the purpose of 
discovering scholarship candidates in rural districts. 


In trials with 300 cases, children from a “very rural district” tested 
about a year behind children “from a large city” (the age of these pupils 
is not stated). But in three schools in the Cheviot valleys (where little 
drainage of descendants of Border troopers to the cities had taken 
place), 80 percent of 25 pupils tested above 100 I.Q. in one school, 70 
percent of 17 in another, and 50 percent of 22 in another. ‘These per- 
cents were higher than in suburban regions. In a*“‘poor suburb,” per- 
cents above 100 I.Q. were 29 and 35; in a well-to-do suburb, 55 and 56. 
The study corroborates Miss Bickersteth’s results—i.e., ability is high- 
est close to and far away from cities. 


Comment: These results suggest that the sectional differences 
found so universally upon intelligence tests may be due much more to 
selection and much less to environment than the advocates of environ- 
ment believe. 


88. Yerkes, R., and Anderson, Helen. ‘‘The importance of 
social status as indicated by the results of the point-scale method of 
measuring mental capacity.’’ Jour. Hduc. Psych., 6: 1915, 137-150. 

Kindergarten and first-grade children in a well located and in a 
poorly located school were tested. Pupils in the well located school 
averaged much the higher, except in the four-year-old group. Four- 


year-olds in a poorly located school possibly averaged higher because 
they were less timid. 


3. Race Differences 


a. Differences Between Racial Groups. These differences are 
important to know, but almost no studies of this type provide any 
means for evaluating the contributions ‘of nature and nurture. The 
environment of low-testing races or nationalities is generally in- 
ferior to that of high-testing ones, but we do not know whether this 
environmental difference is a cause or an effect. Until we can 
have controlled experiments upon children of various races trans- 
planted at infancy into uniform environments, precise knowledge 
regarding native racial differences may be impossible to secure. 


89. Arlitt, A. H. ‘‘On the need of caution in establishing race 
norms.’’ Jour. Appl. Psych., 5: 1921, 179-183. 
Because differences on the Stanford Binet between children of high _ 


and of low economic status were greater than differences between aver- 
age levels of American whites, Italians, and Negroes, the author urges 
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that valid racial comparisons can be made only between groups of equal 
economic status. The median I.Q. of whites was 106; of Italians, 85; 
and of Negroes, 83. 


Comment: ‘The author fails to consider that the different intelli- 
gence levels for groups of varying economic status may themselves be 
congenital, t.e., that individuals of certain mental levels may be drawn 
to certain occupations, rather than that economic status conditions in- 
telligence. To make racial comparisons only between groups of equal 
economic status would introduce inextricable factors of selection. 


90. Boas, F. The Mind of Primitive Man. New York, 1922, 
294 pp. 


Chapter I of this book has been summarized in order to illustrate 
the point of view of one school of thinkers with respect to mental dif- 
ferences in various races. Boas protests against the frequent, but “un- 
proved” assumption that the mental status of a people is measured by 
the difference between its social status and our own. He contends, in 
the first place, that the fact that certain racial groups had developed a 
few thousand years ago civilizations superior to that of our own an- 
cestors is not evidence that our race was then an inferior one. No more 
is the white superiority over the achievements of many of the so-called 
‘primitive races’ of this era evidence of a higher endowment of the white 
race. The dissemination of culture is a slow and partly accidental 
process, and the arrival of one race at a certain point of culture a thou- 
sand or two years earlier than another race may be a matter of chance. 


Since real contributions to civilization are made by relatively few 
members of a race, and since a great share of civilized growth takes 
place through the spread of culture from other groups, the problem 
resolves into one of assimilation of culture. Here, obviously, many 
‘primitive’ races are weak. ‘The author, citing a few examples of the 
following conditions, but without proving that they are actually re- 
sponsible for the inferiority of the races concerned, concludes that the 
question of racial differences is still unsettled. The conditions that may 
account for the slow progress of some races are: (1) the difference 
in appearance of primitive and civilized races, and consequent racial 
prejudices; (2) the ravages of diseases, which have usually been intro- 
duced to primitive people by white settlers; (3) the fact that hand-made 
products cannot compete with manufactured ones brought in with white 
settlers in great abundance, so that the progress of ‘home industry’ is 
blocked; and (4) the circumstance that the primitive people are some- 
times crowded out of their accustomed haunts by sheer force of numbers. 


Comment: While the study has little that could be called evidence 
either for or against innate racial difference, it presents speculations 
of an important nature. 
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91. Brigham, C. C. A Study of American Intelligence. Prince- 
ton, 1923, 210 pp. 


This is a further analysis of data on the American army intelligence 
tests. The chief contribution, other than recapitulation of results re- 
ported in Volume 15, National Academy of Science Memoirs, is the 
tabulation of test scores by the estimated proportions of Nordic, Alpine, 
and Mediterranean racial strains in men from the European countries. 
Weighting the score distributions of the foreign countries in propor- 
tion to the estimated number of each racial strain comprising the for- 
eign country in question, an “approximate sampling” was obtained to 
represent the level of the three races. These scores’, with the S. D.’s 
of the distributions were: 


Nordic Alpine ae en 
IVICA crestint cieteae oleteiare al ouvoteiateoistatcs ehete ee ears eta 13.28 11.67 11.43 
Pops) D RINE Aol Sha aaah Te Re a cee eae a Un, PRE 2.70 2.87 2.70 


IN Soi ehetcsrae ll stars cys icin te atte amare ap raha 3456 4766 4196 


When non-English speaking Nordics were used in similar compari- 
sons, differences persisted, but became smaller. 

The average scores of men who had been in this country for vary- 
ing lengths of time were: 
0-5 yrs. 6-10 yrs. II-I5 yrs. 16-20 yrs. Over 20 yrs. 

11.41 11.74 12.47 13.55 13.82 

These differences probably could not be entirely racial ones, be- 
cause the period during which the greatest drop in proportion of Nordic 
immigration to this country occurred does not correspond to an especially 
marked change in average score for the above classified groups. Differ- 
ences are perhaps accounted for in part by increasingly inferior immi- 
grants and in part by the effect of Americanization over a period of 
years. 


92. Brown, G. L. ‘‘Intelligence as related to nationality.’’ 
Jour. Educ. Research, 5: 1922, 324-327. 

Stanford Binet tests given to 913 school children yielded median 
1.Q.’s that ranged from 104 for Norwegians, 102 for Swedes, Germans, 
and English, down to 86 for Slovaks, and 78 for Italians. 

93. Clark, W. W. ‘‘Birth rate and native intelligence.’’ 
Psych. Clinic, 14: 1922, 111-115. 


Los Angeles negro children in five schools did practically as well 
on the National Intelligence Test and a battery of achievement. tests 


?The scores, whether on Alpha, Beta, or Stanford Binet, were transmuted 
into comparable units. 
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as did white children. The children were from 15 representative 
Los Angeles schools. 


Comment: Selection probably accounts for this result, since no 
other studies have yielded equivalent performance for negro and white 
groups. 


94. Darsie, M. The Mental Capacity of American-Born Jap- 
anese Children. Baltimore, 1926, 89 pp. (Comp. Psych. Mono- 
graph, 3, No. 15.) 


Five hundred seventy urban Japanese children (representing prac- 
tically all children from 10 to 13 in certain communities) tested from 6 
to 12 points below white children of those ages on the Stanford Binet, 
but on the Army Beta Test the Japanese were equal to the white at 
age 10 and improved, relative to the whites, until 75 percent exceeded 
the white median at age 12. 

The author suggests that the discrepancy on Beta at different ages 
may be explained by the fact that Beta is not difficult enough to dis- 
tribute ability in American children after age 11; but in the case of 
the Japanese, 82 percent of their superiority after age 12 is due to per- 
formance on the digit-symbol and number-comparison tests, which pro- 
vided ample range. The author thinks that this may represent an “in- 
dustry” factor, or factor of “sustained attention.” 

On the Stanford Achievement Test the averages of total Japanese 
scores from age 10 to 13 were approximately at the white norm, but 
on reading, language, and science, the Japanese at age 13 fell behind 
the norm almost a year. 

Seven judges were asked to rate Binet items for their dependency 
upon linguistic skill. The rank-order correlation composite judgments 
of linguistic dependency and the degree of superiority of white over 
Japanese on specific items was .62 or .65, depending on how the superi- 
ority was measured. The author concludes: (a) “There is a very 
definite tendency for the Japanese to equal or exceed American norms 
as they pass from the linguistic to the non-linguistic tests.” (b) “It is 
highly probable that the innate mental capacity of Japanese children is 
greater than the Binet I.Q. would indicate. How much greater the 
writer is unable to estimate.” (c) “Japanese superiority in the enclosed 
boxes, Binet paper cutting, and code tests is established beyond ques- 
tion.” These, like the digit-symbol and number-comparison tests of 
Beta, seem to require acuity of visual perception and sustained attention. 


95. Fukuda, T. ‘‘Some data on the intelligence of Japanese 
children.’?’ Amer. Jour. Psych., 34: 1923, 599-601. 

Forty-three Japanese children in Denver had an average I.Q. of 97. 

96. Garth, T. R., and Whatley, C. A. ‘‘The intelligence of 
Southern negro children.’’ School and Soc., 22: 1925, 501-504. 
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The average I.Q. on the National Intelligence Test of 1300 negro 
children in southern cities was 75. 


97. Goodenough, Florence L. ‘‘Racial differences in the in- 
telligence of school children.’’ Jour. Exper. Psych., 9: 1926, 
388-397, 


The subjects were “2457 public-school children (in Grades I to IV) 
practically all of whom were American born, but in whose immediate 
ancestry a number of racial stocks are represented. Children of mixed 
parentage have not been included, except in the case of the Negroes 

. and the Indians.” 

The Goodenough Intelligence Test for Young Children (drawing a 
man) was applied. Means, medians, S.D.’s and coefficients of vari- 
ability were computed for 16 groups. 

Mean I.Q.’s varied from 106.1 (Jewish); 104.1 (Chinese); 103.5 
(Scandinavian); 101.9 (Japanese); 101.5 (American); down to 89.1 
(Italian); 88.5 (Spanish-Mexican); 85.8 (Calif. Negroes); 85.6 
(Hoopa Valley Indians); 78.7 (Southern Negroes). The results from 
this non-verbal test thus seem to agree with other results from verbal 
tests. 

This article includes an excellent tabular summary of representative 
studies on test scores of school children as conditioned by race or nation- 
ality. The summary lays no claim to being complete, however, and 
omits a number of important groups, such as Russian and Japanese. 
The summary is reproduced on p. 287 because it presents many data in 
compact form. ? 


98. Kubo, Y. ‘‘The revised and extended Binet-Simon tests ap- 
plied to the Japanese children.’’ Ped. Seminary, 19 :1921, 187-194. 

An adaptation and translation of the Binet Scale given to 1200 Japa- 
nese children of ages two to 15 in Tokyo yielded average 1.Q.’s of 98 
and 99 at various ages. 

99. Mayo, M. J. The Mental Capacity of the American Negro. 
New York, 1913, 70 pp. (Archives of Psych., No. 28.) 

Of 150 negro high-school pupils under economic conditions similar 
to those of 150 white high-school pupils, 29 percent of negroes exceeded 
the white median of high-school marks. 

100. Murdoch, Katherine. ‘‘A study of race differences in 
New York City.’’ School and Soc., 2: 1920, 147-150. 

Hebrew, American Negro, and Italian children of approximately 
the same economic status and all known to have no language handicap 
were given the Pressey group tests. The median of the Hebrews -was. 


surpassed by 57.7 percent of Americans, 30.3 percent of Negroes, and 
15.5 percent of Italians. 
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INTELLECTUAL ABILITY OF AMERICAN ScHOOL CHILDREN OF VARIOUS 
Ractau Srocks (After Goodenough) 
SSS eo 


Racial Stock Reported by 


White American.......] Pintner and 


Keller...... 


White American....... Sheldon...... 
White American....... Dickson... 
American Negro (Ohio) } Pintner and 
Keller...... 
American Negro....... Peterson...... 
(Tennessee) 
American Negro....... JOLGAN . cetsue 
(Arkansas) 
American Negro..,....] Thorndike.... 
(Northern) 
American Negro....... SUNNE.. cose 
English: creteisieuslshevers cel Pintner and 
Keller...... 
English=ec.cfeciteeccie ol LOW D nets, « 
Ttaliab i. cies sisis sloytiens Pintner and 
Keller...... 
PTAA ss orapetotoreisie oe nt Dickson...... 
Taliant. aes sivietensterae Younge=.. 1a; 
JDOTE MBG Setinnpaaoode Brown. «+.-.. 
German....... fodbenat Pintner and 
Keller.... 
German setae tjeias)siee rie Browdlesecie de 
Jewish..... Sondocr: Pintner and 
Keller...... 
DOWIAN ae nies oicvele sleinislsinie Murdock..... 
Chinese (San Francisco)} Yeung....... 
Chinese (Hawaii)...... Symonds..... 
Spanish-Mexican.......| Sheldon...... 
Spanish-Mexican....... Dickson 2.1 
Portuguese............| Young (Quot- 
ing unpub. 
data by Roll) 
Portuguese............| Dickson...... 
Norwegian............| Brown 5 
Swedish). cis orcs aek/e/< 2 SLOWD s)o010,6 <2 
Austrians sci reels cieio)|\ STOWD. osler60 « 
French? . 642s: 68 ssiie!| C DTOWD ah-).016'- 
[iE mo Sodnonocon ool. BHM oa car] 


Slavish.......+.+e.-«+-| Eintner and 
Keller...... 
Hungarian............| Pintner and 
Keller...... 
Indian (Michigan)..... FROWOmelelse lees 
Am Diam arolete iste ale lalele’s eleleie | ELULICOT: sieleveralore 


Number of 
Cases 


249 


49 
71 


All children 8- 
10 yrs. in 
several 
schools. .... 

247 


25 


Test Used Result in I.Q. 
(Unless otherwise 
stated) 
Stanford-Binet 95 
Sab. 104 
FS}, 184, 106 
Anti 32% 88 
Pressey...... 75 
Nig De Dasts. rete 14 yr. negroes equal 
10 yr. whites 
Dae. Ca. 4% negroes pass- 
ed median score for 
whites ofsame grade 
N.1,T. and Negro ave. 1-1} yrs. 
Myers...... M.A. below whites 
SEB eer kriet ass 97 
Si DB gecsyeistete eters 101.8 
uD Sears oelsccre 3X 84 
SMB nemcnscetherntese 84 
Army Alpha About 83 
and Beta... 
SUBS Pe areccuvevstels 77.5 
PH ESS aNooD ar 91 
S.Bi vocal et 102.3 
S:Brasteaeces 95 
Pressey....... Jews approx. equal to 
white Americans 
SB as. acaciy 97 
Pintner non- 99 
language... 
S.B. and Cole- 89 
Vincent.... 
Si Biskiepiie cine 78 
Bi Bice sence 86 
See iaeitieeree 84 
BBs a. iesetesiets 103.8 
Be rsete cave 101.9 
SeBeweiecas 99.5 
Sey lststscetela 95.4 
BB igiasretscheperecs 90 
SB eerie Sesayeiets 86 
SUB. ctecaereet 89 


Goddard-Binet} Only 5.8% of Indians 
tested at age or 
above, 

Whites excel Indians 

by 1.6 P.E. of latter. 

Correlation bet.degree 
of white blood and 
score = .61 + .017. 


Otis. oo... 
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101. Peterson, Joseph. The Comparative Abilities of White 
and Negro Children. Baltimore, 1923, 141 pp. (Comp. Psych. 
Monograph, 1: No. 5.) 


Three hundred fourteen negro children, aged 7 to 10, were about 
1.5 P. E. units inferior to white children, both upon Peterson’s Rational 
Learning Test and upon elementary, verbal group tests. Two hundred 
eighty-four of them, roughly divided into five classes on the basis of 
color, yielded a mean square contingency coefficient between color and 
Rational Learning score of .50. 


Comment: It is significant to know that differences exist even on 
tests which would appear to be little influenced by schooling, although 
this study, like most other studies of racial difference, does not separate 
the influence of heredity from that of environment in any crucial way. 


102. Pyle, W. H. ‘‘A study of the mental and physical char- 
acteristics of the Chinese.’’ School and Soc., 8:1918, 264-269. 
Children in China tested on a battery of five mental tests show a 


superiority to American children on rote memory, but an inferiority to 
Americans on logical memory, substitution, analogies, and spot pattern. 


103. Schwegler, R. A., and Winn, Edith. ‘‘Comparative study 
of intelligence of white and colored children.’’ Jour. Educ. Re- 
search, 2: 1920, 838-848. 

Fifty-eight white boys and girls in a Kansas Junior High School 
tested 10 points higher on the Stanford Binet than 58 negro boys and 
girls. 


Comment: The negro children were probably somewhat selected, 
owing to attendance at junior high school. Yet even so, a significant 
racial difference was present. 


104. Seago, D. W., and Koldin, T. S. ‘‘A comparative study 
of the mental capacity of sixth-grade Jewish and Italian children.’’ 
School and Soc., 22: 1925, 564-568. 


Jewish boys test considerably higher than Italian boys on the 
National Intelligence Test. 


105. Strong, A. C. ‘‘Three hundred fifty white and colored 
children measured by the Binet-Simon Measuring Scale of In- 
telligence.’’ Ped. Seminary, 20: 1918, 485-512. 


Of children tested in Columbia, S. C., schools, 60.8 percent colored 
and 25.2 white were below the norm, while 9.2 percent colored and 
26 percent white were above the norm. (Cf. abstract of same article, 


No. 81.) 
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106. Yeung, K. T. ‘‘Intelligence of Chinese children in San 
Francisco.’’ Jour. Appl. Psych., 5: 1921, 267-274. 


One hundred nine Chinese children in San Francisco had a median 
I.Q. of 97 on the Stanford Binet. Tests were translated into Chinese 
when marked language handicap was present. 


107. Yerkes, R. M. (Editor) Psychological Examining in the 
United States Army. National Academy of Science Memoirs, 15: 
1921, 890 pp. 


Chapter 6: “Relation of intelligence ratings to nativity.” 

Some 13,000 out of 94,000 of the white draft of the principal 
sampling were foreign born. Only countries from which 100 or more 
cases were reported are considered. 

The rank order of countries on the basis of men obtaining A or B 
ratings (apparently on whatever test they took—Alpha, Beta, Binet, or 
Performance) is that shown in the accompanying table. 


Country Percent Country Percent 
Englands sneer ce 119) 9/ Hrelandipee ssn cae 4.1 
Scovlandarenr see emit 13.0 AL FOnCIST ele weenie 4.0 
iWiniterdratteys este. 12.1 Turkey. i.e see eiatie ae 3.4 
Foland #2e5 nace cee: 10.7 IAVISET IAS cette, socio! feuscaretsrs 3.4 
Canada met mrest ccc eee 10.5 Russia teas siete een Dadl 
Germany; of cess oalon = 8.3 Greecene ct sas e aoisieas 2.1 
Denmarlewa.ton teehee 5.4 Ttallyaien aie steinacke spats 0.8 
DWEGONES Westen reuse: 4.3 iBelomuma neice eters 0.8 
INOL WAY 2 tecestohecceieke ties « 4.1 Poland 2eee ceteris ee teins 0.5 


Tables are also given of total Alpha and Beta distributions, total 
letter-grade distributions, etc. 

Chapter 11: “Intelligence of the draft.” 

In a sampling from all states, 19,000 negroes had a median M. A. 
of 10.1. 

Chapter 10: “Statistics on education.” Three thousand negroes 
had a median Alpha score of 38.6, and 2000 southern negroes a median 
of 12.4. These values are contrasted with scores for 13,000 officers, 
139.2; 52,000 native-born whites, 58.9; and 4000 foreign-born whites, 
46.7. 

108. Young, K. ‘‘Mental differences in certain immigrant 
groups.’’ University of Oregon Publications, 1: 1922, No. 11. 
Also Scient. Mo., 15: 1922, 417-434. 


Italian, Portuguese, and Mexican I2-year-olds from California 
tested in the neighborhood of one S. D. below American 12-year-olds on 
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both Alpha and Beta test. (See also data from Yeung in the tabled 
summary of Goodenough in this section.) 


b. Differences Between Groups of Mixed Blood. In so far as 
the environment of individuals of mixed blood is similar to that 
of full-blooded individuals, evidence of this kind is important. 
But the degree to which environment is similar is extremely 
problematic. 


109. Ferguson, G. O., Jr. The Psychology of the Negro. New 
York, 1916, 1388 pp. (Archives of Psych., 5, No. 36.) 


Three hundred nineteen negro pupils in Richmond were divided 
on the basis of skin color into full, three-fourths, one-half, and one- 
fourth blood mixtures. As the author states, whatever inaccuracy enters 
into the classification would restrict, rather than emphasize, differences. 
On three tests the average percent of negroes aged 12 to 17 reaching 
or exceeding the white norms was that shown here in tabular form: 


Full and One-Half aid 
Three-Fourths One-Fourth 


Test I, Mixed relations 


OV Bis cre: Httae Ehints oe RRS goes 17.0 88.5 
Girls ee ia hate Mere ein BAe cle oteieed ee 25.5 41.3 
Test IJ, Mixed relations 
One ou nthe sicrokc mnie cetera tae 19.2 46.8 
Girlstee a sous Sabrinectcits oes PX AP 42.8 
Completion Test 
LS LON CRA Gland GRO BAe OInecL aor OAT Oe 19.0 44.8 
Ganley Wectte a arene tra merenetecs See cone 23),2 31.5 


The author points out that these comparisons on the basis of amount 
of negro blood would be even more striking if selection in the schools 
did not weed out negroes so rapidly. He thinks the results are strong 
evidence for heredity, since the social limitations of mixed-blood negroes 
are about the same as those of full-bloods. 

Comment: ‘Though possible differences in social status are not 
altogether ruled out, the results are most suggestive. 


110. Garth, Thomas R. ‘‘A comparison of the intelligence of 
Mexican and mixed- and full-blood Indian children.’’ Psych. Rev., 
30: 1923, 388-401. 


The National Intelligence Test was administered to 307 Mexicans, 
126 mixed-blood Indians, 176 Plains Indians and South-East full-bloods, 
249 Pueblo full-bloods, and 85 Navajo and Apache full-bloods. The In- 
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dians were tested in Indian schools at Chilocco and at Albuquerque. The 
Mexicans were tested in San Antonio schools. 

Groups were subdivided by ages from 12 to 19. In each sub-group 
the order of merit was: mixed-bloods, Mexicans, Plains and South- 
East, Pueblo, Navajo and Apache. The mixed-bloods had 80 percent 
exceeding the median of the Plains and South-east at ages 12 and 13, 
and 60 percent at ages 18 and 19. Estimates of social status indicated 
the same sequence as the foregoing, and the average amount of educa- 
tion of the blood groups followed the same sequence, except that the 
Pueblo Indians had slightly more education than the Plains and South- 
East Indians. Because of the differences just mentioned, the factor of 
environment cannot be untangled. 


111. Hunter, W. S., and Sommermier, Eloise. ‘‘The relation 
of the degree of Indian blood to score on the Otis Intelligence 
Test.”’ Jour. Comp. Psych., 2: 1922, 257-277. 


Seven hundred eleven Indian pupils (chiefly between 14 and 20) 
were divided into four classes on the basis of degree of Indian blood. 
The correlation between degree of Indian blood and Otis Group Test 
scores was .35 = .02. This became .41 when age and months of school- 
ing were partialled out. Schooling and Otis correlated .35 for the entire 
group, and the correlation varied only between about .30 and .40 when 
“degree of blood” groups were taken separately, from which the conclu- 
sion was drawn that schooling is not a prominent factor in the correla- 
tion between degree of blood and Otis. 

The authors admit their material is limited by the fact that no data 
are available on the relative social status of the mixed groups. Also, 
the possibility of temperament accounting for differences is not ruled 
out, though observation of “the manner in which the students worked 
upon the tests,” and the fact that all were accustomed to school work, 
lead the authors to infer a racial difference. 


112. Peterson, Joseph. The Comparative Abilities of White 
and Negro Children. Baltimore, 1923, 141 pp. (Comp. Psych. 
Monograph, 1: No. 5.) 

See summary in Part a of the section on race differences, No. 101. 

113. Reuter, E. B. ‘‘The superiority of the mulatto.’’ Amer. 
Jour. Sociol., 23: 1917-1918, 83-106. 


The author argues upon evidence that while most outstanding 
negroes are individuals of mixed blood, it is not known to what degree 
this fact is due to racial superiority of the white race, and to what de- 
gree to the “social superiority” of the mulattoes operating “to bring 
into the group the favorable variant types produced by the entire race.” 


ce. Attempts to Evaluate the Effects of Foreign Language 
Handicap. Such attempts are usually made by comparing the 
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results of verbal and non-verbal tests; but, as is pointed out in 
Chapter II on ‘‘Statistical Hazards,’’ verbal and non-verbal tests 
are not comparable. Correlations between length of time English 
has been spoken and performance on verbal tests provide the most 
useful approach. 


114. *Bere, May. A Comparative Study of the Mental Capac- 
ity of Children of Foreign Parentage. New York, 1924, 105 pp. 
(Teachers College, Columbia University, Contrib. to Ed., No. 
154.) 

In groups of Hebrew, Bohemian, and Italian children, each com- 


prising 100 ten-year-old boys, the following results were secured with a 
verbal and a non-verbal test: 


Stanford Binet Stanford Binet | Pintner-Paterson 
1) M.A. M.A. 


Mean S.D. Mean S.D. Mean S.D. 


Mtaltamiiatae ee teytoxsns avers 85 10 108 12 109 21 
‘Bohemianw ceases 93 10 118 15 123 16 
TORE Wiavta cierto 98 13 123 16 102 20 


The P.E. of the difference in M.A.’s between the two tests for each 
nationality or between any two nationalities on either test is about one 
month. 

To test out the theory that the diminution of differences between 
the three groups on the Pintner-Paterson is due to elimination of lan- 
guage handicap, comparison was made of the average length of resi- 
dence in this country of parents of total groups and of parents of chil- 
dren having higher rating on the Pintner-Paterson than on the Binet. 
Thus: 

Italian, 19.47 years versus 19.3 for total group; 

Bohemian, 19.2I years versus 19.7 for total group; 

Hebrew, 18.53 years versus 17.56 for total group. 


It was found also that among the children having a higher rating 
on the Pintner-Paterson, foreign language exclusively was used in the 
homes of: 

23 percent of the Italians versus 23 percent of all the Italians; 

19 percent of the Bohemians versus 29 percent of all the Bohemians; 

o percent of the Hebrews versus 3 percent of all the Hebrews. 


The author concludes that: ‘The zero correlation between Stan- 
ford Binet and Pintner-Paterson Performance Test results and aver- 
age length of residence, plus the rise in Stanford Binet and Perform- 
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ance mental ages with increased use of English in the home . . . do not 
support the argument that a language handicap is responsible for low 
I.Q.’s on the Stanford Binet tests.” 

The differences between results on verbal and non-verbal tests “sug- 
gest . . . . that these groups may differ in the nature, as well as in the 
amount, of their abilities.” 


115. Colvin, S. S., and Allen, R. D. ‘‘Mental tests and lin- 
guistic ability.’’ Jour. Educ. Psych., 14: 1923, 1-20. 


Fifty children of American parentage and 50 of Italian parentage 
in the Providence schools were tested on the Stanford Binet, the Na- 
tional Intelligence Test, and the Lippincott-Chapman Classroom Prod- 
ucts Survey Tests. They ranged in age from 11 to 16, with an average 
C.A. of 13.1 in each case. All were in Grades V to VIII. How they 
were selected is not stated. The results are shown in the table. 


American Italian 
SINE Gale) Seaeeee Merton ue ne ct elettertae a oketis G.o0e el anesenahs 92 91 
INTE 1 C) Soper aby aerate vase scetis cistiow rarcelt ot aoeae & cist? 85 76 
Pedag. age (presumably on Lippincott-Chapman). . 11.8 12.0 


Since the Americans and Italians had equal standing on the Binet 
and on Achievement, the authors attribute the 9 points difference on 
N.LT. to language handicap. Binet and N.I.T. correlated .73 for the 
Americans and .79 for the Italians. 

Comment: The conclusion seems reasonable, though not definitely 
proved, because Binet and N.I.T. measure abilities which do not en- 
tirely overlap (correlating with each other only about .70 to .80 in single 
school grades). 


116. Feingold, G. A. ‘‘Intelligence of the first generation of 
immigrant groups.’’ Jour. Educ. Psych., 15: 1924, 65-82. 


American children of foreign parentage in a Hartford high school 
were tested by Army Alpha. The rank order of average scores for the 
different nationalities was almost identical with that of the army results. 
Considering here only the results from the freshman class, the largest 
difference between groups in mental age was 11 months (between Brit- 
ish and Polish, and British and Italian). This, compared with the mental 
age difference of two years between native and foreign-born draftees, 
suggests that some of the army differences may have been due to lan- 
guage handicap. 

Comment: 'The comparative uniformity of the high-school pupils 
may have been due to selection exercised by attendance at the high school. 
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117. *Madsen, I. N. ‘‘Some results with the Stanford Revision 
of the Binet-Simon Tests.’’ School and Soc., 19: 1924, 559-562. 


Sixteen Italian children thought by their teachers to have language 
handicaps were tested on entering the first grade and retested a year 
and a half later. The mean I.Q. on the first test was 79.3, and on the 
second test, 80.1. The correlation between test and retest was .97 + .oI. 
No evidence appears of any influence of language handicap. 


118. Pintner, R. ‘‘Comparison of American and foreign chil- 
dren on intelligence tests.’’ Jour. Educ. Psych., 14: 1923, 292-295. 


American third- and fourth-grade pupils show less superiority to 
foreign pupils on the Pintner Non-Language Test than on the National 
Intelligence Tests. 

Comment: See comment on Pintner and Keller article, No. 120. 
Ambiguity also lies in the fact that M.A. comparisons are based on chil- 
dren by grades rather than on children of known ages. 


119. Pintner, R. ‘‘Results obtained with the non-language 
group test.”? Jour. Educ. Psych., 15: 1924, 473-483. 


Dividing 1361 twelve-year-olds into those with Anglo-Saxon, Ital- 
ian, German or Dutch, and miscellaneous names, it was found that the 
only two groups showing a significant difference on the Pintner Non- 
Language Test were the Anglo-Saxon and Italian. The author con- 
cludes that it is significant that differences obtain between Italians and 
Anglo-Saxons on a non-verbal test, even though they are not so large 
as differences on a verbal test. The difference was about .25 of the 
S.D. of Anglo-Saxons. 


Comment: ‘The result is significant, but helps little in the inter- 
pretation of the far larger differences found on verbal tests. 


120. Pintner, R., and Keller, R. ‘‘Intelligence tests of foreign 
children.’’ Jour. Educ. Psych., 13: 1922, 214-222. 


Second-grade children tested upon the Youngstown Children’s 
Bureau Binet Revision and the Pintner Non-Language Test were divided 
into 49 of English-speaking parentage and 56 of foreign-speaking 
parentage. 

Seventy-three percent of the English-speaking, and 82 percent of 
the foreign-speaking were higher on the Pintner than on the Binet. 

Using the median mental age on at least three performance tests 
for each child, 52 percent of 45 English-speaking children, and 75 per- 
cent of 95 foreign-speaking children were better on the performance 
tests than on the Binet. 

The average I.Q. of 367 English-speaking children was 92, and-of ~ 
674 children from foreign homes, 84. 
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The authors conclude: “Children who hear a foreign language at 
home test lower as a rule when given the revisions of the Binet Test 
than when given tests which require a minimum knowledge of English. 
. . . Those children who hear a foreign language in their homes may 
suffer a serious handicap when tested only by the revisions of the Binet 
Test.” (See also data from Pintner and Keller in the tabled summary 
by Goodenough, No. 97.) 

Comment: ‘The fact that the non-language and the performance 
tests are measuring different abilities from those sampled by the Binet 
Test makes the interpretation ambiguous. 


121. Saer, D. J. ‘‘An inquiry into the effect of bilingualism 
upon the intelligence of young children.’’ Jour. Exper. Ped., 6: 
1922, 232-240. 


The phase of the experiment of especial interest to our problem 
concerned boys of 7 to 10 at Aberystwyth, (selected from 260 boys of 
7 to 14). The exact number is not stated for this portion of the 
investigation. 

The Stanford Binet, plus non-overlapping questions in Binet’s 1911 
scale and in Burt’s version, was given the subjects. The tests were 
carefully translated into Welsh, and two years later given to boys in 
the group who were bilingual, i.e., boys who spoke English fluently, but 
whose “mother-tongue” was Welsh. To those who were monoglot, i.e., 
whose mother-tongue was English, the same English tests were repeated 
in two years. The former group showed a gain in M.A. two and two- 
thirds months greater than that of the second group. 

Comment: This section seems inadequately reported. Numbers are 
not given, nor the method of computing I.Q. However, the excess gain 
seems significant, since elsewhere the initial I.Q.’s of the two groups 
are seen to have the same medians—100 and 99. 


122. Sunne, D. ‘‘Comparison of white and negro children in 
verbal and non-verbal tests.’’ School and Soc., 19: 1924, 469-472. 

Negro children did slightly less poorly upon the Myers Mental Test 
than upon the National. The author concludes that “it is difficult to 
determine how much racial differences and how much differences in 
school training contribute to the divergence in test results.” 

123. Walters, F. C. ‘‘Language handicap and the Stanford 
Revision of the Binet-Simon tests.’’ Jour. Educ. Psych., 15: 1924, 
276-284. 

One hundred and sixty-five children in a New York school of a 
foreign district with an average I.Q. of 96 and average C.A. of 12-8 


were divided into those hearing chiefly foreign language and those hear- 
ing chiefly English at home. The Stanford-Binet items on age levels 
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from 10 to 16 were divided into language and non-language tests, and 
1.Q.’s computed for the children with and without inclusion of the 
“language” tests. 

Comparing I1.Q.’s by the two methods, the author concludes that 
there is a language handicap of from 6 to 8 months of mental age for 
children of 13 coming from foreign-language-speaking homes. 

Comment: ‘The most valid and most reliable of the Binet test ele- 
ments are the verbal ones. Hence a convergence of the groups on the 
“non-language” tests might easily occur, even if there were no language 
handicap among the children. Thus, the data of this study provide 
nothing crucial. 


4, Intelligence and Schooling 


Simple correlations between amount of schooling and intelli- 
gence of course contribute nothing crucial, since such relationships 
might or might not be due to selection. It is only when differences 
in intelligence appear among comparable individuals for whom the 
amount of schooling is the only primary difference that definite 
conclusions can be drawn. 


124. Bagley, Wm. C. Determinism in Education. Baltimore, 
1925, 194 pp. 


Chapter IV. Do good schools pay? Inter-correlations are reported 
between the following data for the states: 

1. Sandiford’s 1923 distribution of states by percent of A and B 
men in Army Alpha. 

2. 1920 census report on literacy. 

3. Reeder’s 1923 rankings of percent capita circulation of I0 
magazines. 

4. Proportion of population in Who’s Who 1850-1880. 

5. Knauth’s distribution of per capita income, 1922. 

Using the 26 states having over 55 percent native population, these 
r’s range from .73 to .Ol. 

The correlations between Army Alpha test rank and Ayres School 
Index (the latter computed for various decades) are also reported as: 
1880, .83; 1890, .82; 1900, .89; 1910, .87; 1918, .83. 

The author points out that Alpha ranking correlates highest with 
school ranking in the decades that the r’s should be highest (i.e., when 
the army draftees were in school). “Recent school conditions,” he says, 
“have not affected the present dominant generations, but school condi- 
tions 20, 30, and 40 years ago did have a profound effect if our figures 
tell the truth.” 

It is also shown that the average correlation of four of the above 
measures with one another, “when the influence of school opportunity 
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is cancelled,” is only —.o9. ‘The author ascribes this decrease to the 
“predominantly causal influence” of school opportunity. 


Comment: It must be kept in mind that the high inter-correlations 
reported are based upon group averages and hence do not represent the 
true relation between the variables in question. Also, the average of 
the partial correlations with School Index constant is a highly ambigu- 
ous measure, because it is not known whether a cause or an effect is 
being partialled out. It seems fair to say that no data are reported 
here which could not as readily be interpreted in favor of heredity as 
environment. 


125. Burt, C. Mental and Scholastic Tests. London, 1921, 
432 pp. (Sec. 6, pp. 175-184. ‘‘The relations between mental 
ability and educational attainments.’’) 


(1) For 689 London school children of ages 7 to 14 a correlation 
coefficient was computed to predict mental age on Burt’s revision of the 
Binet scale from a composite of educational age, chronological age, and 
mental age on a “reasoning test” devised by Burt. The equation was: 
B =.54S + .331 +.11A, where S was equal to educational age (or 
schooling), I was equal to standing on the reasoning test, and A was 
equal to chronological age. The author attributes to every factor a 
“causal” influence equal to its proportional weight in the regression 
equation. 

(2) In another section, the author cites 10 cases out of the total 
group showing an E.Q. 30 percent or more below the I.Q., but no child 
has an I.Q. so far below his E.Q. 


Comment: (1) See critique of this study by Holzinger and Free- 
man summarized farther on (No. 128). 


(2) The data above indicate that I.Q. places an upper limit upon 
the E.Q., and illustrate a method by which test data can be made to 
yield more than mere group treatment reveals. 


126. Ellis, R. S. ‘‘A comparison of the scores of college fresh- 
men and seniors on psychological tests.’’ School and Soc., 23: 
1926, 310-312. 


One hundred twenty-one Syracuse students who took the Miller 
Mental Ability Test when they were freshmen and three years later 
when they were seniors showed no improvement in average score. It 
was found, however, that gains of approximately .5 $.D. of the fresh- 
men distribution were made during three years on the following tests: 
(1) sentence completion, (2) artificial language, and (3) opposites. 

The author attributes these gains to “transfer of training” during 
the college course, rather than to any growth of fundamental capacities. 
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127. *Gordon, Hugh. Mental and Scholastic Tests Among Re- 
tarded Children. London Board of Education, Ed. Pamphlets, 
No. 44, 1923, 92 pp. 


This contribution consists of four separate studies. 


Study I. “Physically Defective Children” 

In two schools for the physically defective, 184 children whose rec- 
ords of school attendance could be traced had been in attendance only 
48 percent of the time since the age of five. (The corresponding per- 
cent for children in ordinary elementary schools is 88). The average 
I.Q. of the children (who were nearly all between 6 and 12 years old) 
was about 85. I.Q. and E.Q. (based upon the Ballard one-minute tests 
in speed of reading, adding, and subtracting) both correlated with per- 
cent of school attendance .31 = .07, which fact is taken as evidence that 
school attendance affects both I.Q. and E.Q. The negative rank-order 
correlations of I.Q. and E.Q. with C. A. were also taken as evidence 
for the influence of schooling. These were: 

LOwand CA: 

73 pupils in one school, — .37 = .07 
83 pupils in one school, — .42 = .06 

E.Q. and C. A. 

83 pupils in one school, — .29 = .07 

Comment: May not the low correlations of attendance with I.Q. 
and E.Q. signify that achievement is far more determined by intelli- 
gence than by attendance? This possibility is strengthened by the fact 
that an r of .78 between I.Q. and E.Q. is only reduced to ae when attend- 
ance is partialled out. 

Nothing in the data, of course, can show whether the small r actu- 
ally obtained between I.Q. and attendance is due to the influence of 
attendance on I.Q. or the influence of I.Q. on attendance. 

The possibility is not dealt with that the slight inverse relation- 
ship between I.Q. and C. A. and between E.Q. and C. A. might be due 
to some type of selection. 


Study II. “Canal-Boat Children” 


This study deals with 76 children of ages 5 to 13 (except two of 
4 years and one of 14) living on English canal boats. On the basis of 
a Ministry of Health report, the health and nutritional status of these 
children seems average. They differ from other children in type of 
home environment and in the amount of their schooling, which is only 
about 4 or 5 percent of normal. 

The Stanford Binet Test and Ballard Achievement Test were ad- 
ministered. The rank-order correlation between C. A. and I.Q. was 
— .76+.03. Taking 22 families in which from two to four sibs were 
tested, the inferiority of the older siblings was also very apparent. 
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Twenty-two oldest sibs averaged close to 60 I.Q.; 22 second sibs close 
to 72; 11 third sibs close to 77; and 5 youngest sibs about 8o. 

Comment: The C. A.’s of these five youngest varied from 4-7 to 
7-3. The median age of the oldest sibs was 12. A drop of a few 
points may be accounted for by a faulty scale, but evidence seems to 
point to an influence of either schooling or home environment. Selec- 
tion was not considered, but there seems no special reason to suspect 
it. It is significant that I.Q. and E.Q. correlate nearly as highly (.715) 
in this group as in the former group (.785), although attendance was 
only 4 to 5 percent. This certainly suggests that achievement is more 
dependent upon intelligence than upon attendance. 


Study III. “Gypsy Children in Surrey and Kent” 


Gypsy children of ages 5 to 13, inclusive, who had attended school 
an average of 35 percent of the time since the age of five, were tested 
by the Stanford Binet and Ballard achievement tests. 

I.Q. and E.Q. correlated .78 = .03 on 60 cases, and this r still re- 
mained at .78 when percentage of attendance was partialled out. 

I.Q. and percent attendance since age of five correlated .37 + .06 
on 82 cases, and E.Q. and attendance correlated .29 on 60 cases. 

C.A. correlated with I.Q.—.43+.06 on 82 cases, and with 
E.Q. — .37 on 60 cases. 

Inverse relationships, as in the case of the canal-boat children, 
were also apparent between I.Q. and C.A. of siblings, though not to 
so great an extent. The average of 7 third children was about 81, of 
21 second children about 78, and of 21 oldest children about 72. The 
third children averaged about 7 years C. A. and the oldest about 11. 

Comment: 'The same comment applies here as in the study on 
canal-boat children. 

The author appears to claim too much when he writes: “It appears 
that for all practical school purposes the three simple scholastic tests 
may be substituted for the mental tests in the case of such children as 
have been tested.” ‘This does not follow from a correlation no higher 
than .78 between I.Q. and E.Q. 


Study IV. “Backward Children” 


Girls of ages 7 to 14 in two London classes for backward children 
had an average I.Q. of 70 to 75 and had attended school an average of 
67 percent of the time. For 76 of them, attendance correlated — .o8 + .08 
with their I.Q.’s and .15 with their E.Q.’s. C.A. correlated — .47 with 
both the I.Q.’s and the E.Q.’s of 67 of them. Since their average attend- 
ance, despite their low I.Q.’s, was higher than that of the subjects of 
the previous sections, the conclusion is drawn that this group has low 
native intelligence. 

The chief correlations of the study are summarized below. 
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1.Q. and | E.Q. and | 1.Q. and | E.Q. and | E.Q. and 
C.A. C.A. Attend. Attend. HO» 


Physical defectives | — .422 — .288 Rolls .313 .785 and .707 
Canal boat........ c= AY Aa} 5 aio (Bun lea eer cl Phcrone Gas tae eoeh Ker RcL sen cer Le AALS) 
Gypsy ate — .566 — .374 .283 .289 . 784 
Backward.........| —.477 — .466 — .162 i2te . 784 


The author concludes: “It is quite evident that, although the mental 
tests used do undoubtedly test some kind of ability or abilities, such 
abilities are not developed without schooling or its equivalent, and as a 
consequence the tests do not evaluate them apart from schooling, ex- 
cept perhaps in the case of children under six or seven years of age.” 


Comment: ‘The following types of evidence were presented: 
(1) High r’s between I.Q. and E.Q. 

(2) Negative r’s between I.Q. and C.A. 

(3) Slight positive r’s between I.Q. and attendance. 


The first type is more likely to mean that E.Q. depends upon I.Q. 
than vice versa, especially since the correlations remain just about as 
high when percent of attendance is partialled out. 

The third type shows low r’s between I.Q. and attendance, but upon 
either a nature or a nurture hypothesis this would perhaps be expected, 
if the variability of attendance were not very great. (S.D. of attend- 
ance was mentioned only in the case of the “backward class” children, 
where it was I5 percent.) Thus, differences in attendance do not ac- 
count much for the differences among these children, but the very low 
percent may or may not have resulted in a large cumulative effect upon 
the I.Q., or the whole r may be due to selection. 

The second type of evidence suggests strongly that environment has 
had an effect, although it is impossible to assume, as the author seems 
to do, that the whole effect can be attributed to paucity of schooling as 
isolated from home environment. The author has not considered the 
possibility of selection in the upper ages, but the fact that in most of 
his groups there are about as many children in the upper ages as in 
the lower makes selection seem unlikely. The fact that in canal-boat 
families a drop in I.Q. occurs from about 90 to 60 in children of the 
same families from ages 6 to 12, and that a similar drop of from over 
80 to 70 occurs in gypsy families, reénforces Gordon’s theory, as do the 
lower negative r’s, in classes of physical defectives and backward chil- 
dren, between I.Q. and C.A. Thus, the very extreme environment of 
the canal-boat children would seem to have a cumulative effect of about 
5 points a year over at least six years. 


128. Holzinger, K. J., and Freeman, F. N. ‘‘The interpreta- 
tion of Burt’s regression equation.’’ Jour. Educ. Psych., 16: 1925, 
577-582. 
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The authors criticise Burt’s famous regression equation purporting 
‘o show that Binet test scores are conditioned to the extent of more 
than one-half by schooling. Burt was in error in assuming a priori that 
his reasoning test was a good measure of native intelligence, that a 
test giving a good prediction of school achievement was necessarily a 
poor measure of intelligence, and that the multiple regression technique 
provided a flaw-proof mathematical statement of the causal relation- 
ships of a group of variables to another variable. This last criticism 
is driven home by a regression equation computed by the authors upon 
Burt’s own data by which the chronological age of London school chil- 
dren is estimated by combining .15 Binet-Simon mental age; .51 school- 
ing; and .03 mental age on the reasoning test. The ironical suggestion 
is made that to keep children chronologically young, we should deprive 
them of schooling and arrest their mental development. 

See also the summaries in this section of Burt’s original con- 
tribution (No. 125), of Thomson’s answer (No. 130) to the present 
critique, and of the rejoinder by Holzinger and Freeman (No. 129). 


129. Holzinger, K. J., and Freeman, F. N. ‘‘Rejoinder on 
Burt’s regression equation.’’ Vouwr. Educ. Psych., 17: 1926, 384- 
386. : 


“Tf the capacity which is measured in the Burt score is intelligence, 
Thomson’s interpretation [of the Burt equation] would seem to imply 
not only that degree of attainment in school does not affect intelligence, 
but also that intelligence does not affect the degree of attainment in 
school. Intelligence in this view must be some sort of useless ornament, 
enabling us to work certain puzzles which the psychologists have de- 
vised, but of no conceivable value in a practical work.” 


130. Thomson, G. H. ‘‘The interpretation of Burt’s regres- 
sion equation.’’ Jour. Educ. Psych., 17: 1926, 301-808. 


The author attempts to defend Burt (and his own previous pub- 
lished acceptance of Burt’s conclusions) against the attack of Holzinger 
and Freeman. The sum total of his argument is that while cause and 
effect cannot be argued from a correlation per se, the most reasonable 
interpretation is that Binet score is affected by school work but that 
Burt’s test is not. 

(See summaries Nos. 125, 128, and 129.) 


Comment: ‘This defense leaves the general field of the effect of 
schooling upon mental age as unsettled as it was before Burt published 
his widely quoted equation. Data are needed which are more crucial 
than any merely giving correlations between M.A. and E. A. 


131. *Thorndike, E. L. ‘‘Mental discipline in high-school 
studies.’’ Jour. Educ. Psych., 15: 1924,'1-22, 83-98. 
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Using as subjects 8564 pupils in Grades IX, X, and XI, and allowing 
for practice effect and superior gains of the ablest on the Inst. of 
Educ. Research Tests of Selective and Relational Thinking, Thorndike 
reports the following results: 11 points for the year are gained by the 
average white pupil; 1.5 points for the year by the average colored 
pupil; 22.5 points, or 11.5 points above average, by virtue of taking three 
courses in science and one in mathematics; I9 points by virtue of tak- 
ing one course each in arithmetic, bookkeeping, stenography, and type- 
writing; 10.5 points for one course each in cooking, sewing, dramatic 
art, and physical education. 

By the best one percent in initial ability 20.5 points are gained; by 
the lowest one percent in initial ability, 1.5 points. 

Also, with respect to individual subjects, if a pupil takes one of 
these: algebra, geometry, trigonometry, Latin or French, instead of one 
of these: cooking, sewing or stenography, the best estimate of his in- 
crement of gain is a little under 2.5 points, far less than the difference 
in gain of average negroes and whites, or of superior and inferior whites. 

The author points out what he considers a lack of justice in re- 
fusing to permit pupils to offer subjects like stenography for college- 
entrance units. “The intellectual values of studies should be determined 
largely by the special information, habits, interests, and ideals which 
they demonstrably produce.” 


132. Wechsler, D. ‘‘On the influence of education on intelli- 
gence as measured by the Binet-Simon tests.’’? Jour. Educ. Psych., 
17: 1926, 248-257. 


The coefficients of variability in Burt’s and Terman’s original Binet 
material at ages 5 to 14 of mental age are studied. The conclusion is 
drawn that education makes children “more alike” after age Io or Il. 
Apparently, the possibilities of ambiguity due to selection and test stand- 
ardization are not considered. 


133. Willard, D. W. ‘‘Native and acquired mental ability as 
measured by the Terman Group Test of Mental Ability.’’ School 
and Soc., 16: 1922, 750-756. 


On the basis of this test, applied to 216 high-school pupils twice, 
(seven and one-third months apart), the author concludes that school 
training contributes to mental growth an amount equal to “a little over 
one-half the growth registered by the tests.” If such a rate were kept 
up, by the end of the high-school course the children could be “classed 
as a group of near geniuses.” 

Comment: The study entirely overlooks probable practice effect, 
which on group tests is usually nearly equal to the gains here attributed 
to school training. 
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134. *Woolley, Helen T. ‘‘The validity of standards of mental 
measurement in young childhood.’’ School and Soc., 21:1925, 
476-482. 


The subjects were 43 children in the Merrill-Palmer School between 
2-6 and 4-6 at first test, and 33 children, from comparable homes on 
the Merrill-Palmer waiting list, between 2-6 and 5-2 at first test. 

The Stanford Binet was administered to all children (presumably 
before entering the school, in the case of the Merrill-Palmer pupils). 
Retests were given from 7 to 14 months later (median interval about 
9 months for each group). Pupils attend the school from 9 A.M. until 
the middle of the afternoon, and are controlled by a definite educa- 
tional program. 

Sixty-three percent of the Merrill-Palmer group increased more 
than five I.Q. points, while 18.5 percent decreased more than five I.Q. 
points. Corresponding figures for the waiting list were 33 percent and 
36 percent. Thirty-three percent of the Merrill-Palmer pupils increased 
20 points or more; Io percent decreased 20 points or more. Correspond- 
ing figures for the waiting list were 6 percent and 16 percent. Actual 
I.Q. distributions are not given. The average I.Q. of I9 cases tested a 
third time was “a little lower” than on the second test, but 18 pupils 
did not show an average decrease after leaving the school. 

Comment: From Woolley’s figures the writer finds an average in- 
crease of 13.6 points for the Merrill-Palmer pupils and an average de- 
crease of 1.6 points for the waiting list. 


135. *Woolley, Helen T., and Ferris, Elizabeth. Diagnosis and 
Treatment of Young School Failures. Washington, 1923, 115 pp. 
(Bur. of Edue. Bull., No. 1.) 


This study deals with 16 children, C. A. 6-7 to 10-4, and I.Q. 75 
to 95 at the beginning of the experiment. Not one had covered the 
prescribed first-grade work. The group constituted the first “observa- 
tion class” of Cincinnati. 

Exceptionally good training was given these children (apparently 
for two years) in a special class. ‘They were tested from year to year 
by the Stanford Binet, performance tests, and educational tests. 

“In every instance, except those of the children who were ulti- 
mately shown to be feeble-minded, and one of those who was suffering 
from special disabilities, the intelligence quotient rose while the child 
was in the observation class and fell after he left it.” 

Ten cases showed increases from 3 to 18 points, with an average 
of 8. “The reéxaminations of these same children after a year to a 
year and a half, in a regular grade, show an equally uniform fail in 
_ intelligence quotient.” Decreases varied from 3 to 19, with an average 


of 8. 
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Comment: Environmental influence is clearly suggested by the 
data; but the cases are too few in number to establish such influence 
with certainty. 

136. Yerkes, R. M. (editor) Psychological Examining in the 
United States Army. National Academy of Science, Memoirs, 15: 
1921, 890 pp. (Ch. X. ‘‘Statistics on Schooling.’’) 

Median schooling is seen to correspond to intelligence medians of 


the following groups (except northern negroes were better schooled, 
but less intelligent than the foreign draft). 


Schooling Median Alpha 


OMCETS Heeieiey cctcteisie es ae Sesoletel ere atten ee ee 14.7 139.2 
Wihiterdratt (native-born). c.'w.eseiseas ene © 6.9 58.9 
White, drait (foreign-born) \. vervicaccoas at ses 4.7 46.7 
INCSTORAT ant MOLtiernmmermercce teastererete ciel: 4.9 38.6 
Negrordratt.ccouthemner os) a). oe voces sa 2.6 12.4 


For members of these five groups who had had equal schooling, 
differences still persisted on Alpha and Beta. For example, among 
those who had from five to eight years of schooling: 


Median Alpha Median Beta 


CUMICETS Ie ie sept calc eremAets orks anclan spits = serrate 107.0 Seas 
White draft, (native-born) .......5......% 51.1 53-3 
WVinitesdra lt, \(torcion=porn) «css serie sels Cae 47.2 52.7 
INGPrO.dratt Doren. <a settles cet Ye 41.5 
INegrovdratt, southerty 1. jess ses cote eee 16.3 35.1 


Six hundred sixty officers with schooling of eighth grade or less 
had a median Alpha score of 107.3, and 14,000 native-born white 
draftees had a median of 97.4, which shows “that education is certainly 
not the chief conditioning factor.” 

On the basis of r’s from a number of groups: “In an unselected 
group (i.e., including those men who would ordinarily be considered 
too illiterate to take Alpha) the correlation with the number of years 
of their schooling approximates .75; in an Alpha group (1.e., excluding 
illiterates) the correlation coefficient approximates .65.” ‘The range in 
schooling for these r’s is from none to eight years of college (white 
native-born); and from none to four years of college in a group of 
native-born and foreign. The r between Beta and schooling for the 
653 men is .67. 

The data are interpreted as meaning that the “theory that native 
intelligence is one of the most important conditioning factors in con- 
tinuance in school is certainly borne out by this accumulation of data.” 

Comment: The data seem to provide evidence of some influence 
of nature; but most of the data could be explained either by nature 
or by nurture. 
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5. Relation to Physique, Health, or Physical Environment 


Studies of this kind, of course, furnish significant evidence in 
favor of acquired changes in mental level only when mental changes 
are demonstrated in connection with physical changes or when 
mental differences are found associated with physical ones in com- 
parable experimental and control groups. A mere correlation be- 
tween disease or defect and intelligence cannot establish a causal 
relationship, since selection or common causes might account for 
such a correlation. However an absence of correlation between 
mental level and physical condition is significant, since such a re- 
sult dispenses with the necessity for considering causal relation- 
ships at all. 


137. *Dawson, 8. and Conn, J. C. M. ‘‘Effect of encephalitis 
lethargica on the intelligence of children.’’ Arch. Dis. Childhood, 
1: 1926, 257-368. 


The following results were based upon Binet tests (Burt’s revi- 
sion) given to 46 children in non-acute stages of encephalitis lethargica 
at the Royal Hospital in Glasgow. 

(1) The patients in question had an average I.Q. of 84.6, as con- 
trasted with 90.5 for 974 other children tested in the hospital. The 
difference is 5.9 + 1.3. 

(2) Twenty-three patients tested less than 12 months after the onset 
of the disease and 23 tested from 12 months to § years 4 months after 
onset had average I1.Q.’s of 89.5 and 79.8, respectively. The difference 
is'.9.7 = 2.3; 

(3) Thirty children retested after 7 to 36 months had an average 
1.Q. of 87.7 on the first test and 76.1 on the second test. The difference 
is 11.6 2.1. In 10 cases there was actual retrogression of M.A. and 
in 20 cases some gain, but in 12 of these the gain was negligible. In 
27 cases there was less gain than normal for children of the given I.Q.’s 
during the interval between tests, and in only five cases was the ratio 
of gain to expected gain as high as 75 percent. The correlation be- 
tween rise above expected gain and interval between the two tests is 
— .55+.08. Thus, mental development is seen to be arrested by the 
disease, although half of 8 patients who were given three tests showed 
some improvement on the third test. 

(4) A group of 23 subjects with available siblings averaged 85.6 
on their first test, and 20 of these averaged 76.0 on a retest. Their 27 
tested siblings averaged 96.0. ‘Thus, the patients differed from their 
siblings 10.4 = 2.2 on the first test and 19.6 on the retest. 

(5) Sixteen patients with the Parkinsonian syndrome and 30 with- 
out it averaged 84.9 and 84.5, respectively. 
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(6) Twenty-nine patients with emotional disturbances and 17 with- 
out them averaged 84.2 and 84.8, respectively. 


138. *Fernald, M. R., and Arlitt, A. H. ‘‘A psychological 
study of a group of crippled children of various types.’’ School 
and Soc., 21: 1925, 449-452. 


In the Cincinnati School for Crippled Children, 49 pupils tested by 
the Stanford Binet had an average I.Q. of 83.9. Eighty-nine siblings 
of these pupils had an average I.Q. of 89.2. The difference, approxi- 
mately five points, is possibly due to the restricted early activities of 
the crippled children, although no relationship between age of onset of 
malady and I.Q. was found among 194 cases, if spastic birth paralysis 
cases were not considered. Twenty-seven of these latter had an aver- 
age I.Q. of only 69. A correlation of .52 + .07 was found between the 
scores of 49 cripples and those of their normal siblings. 

Comment: ‘The correlation immediately above is ambiguous and 
spuriously high because the I.Q.’s of all normal sibs of each family were 
averaged to pair against the I.Q.’s of the crippled pupils. 


139. Heron, David. The Influence of Defective Physique and 
Unfavorable Home Environment on the Intelligence of School 
Children. Cambridge University Press, 1910. 


(As summarized by Heron in “Questions of the Day and of the 
Fray, No. II.’”’) 

“Particulars were given for 4286 boys and 4474 girls regarding 
mental capacity, age, standard [grade], height, weight, and condition of 
teeth, and in certain schools only regarding state of nutrition, condition 
of the clothing, state of cleanliness, power of hearing, and condition of 
the cervical glands, tonsils, and adenoids.” 

“The conclusion was reached that home environment . . . . could 
not be the chief determining cause of the differentiation of intelligence, 
nor was defective physique its source.” 

The possibility was also pointed out that the small relationships 
found might be an “indirect effect of race and stock.” 


140. *Laslett, H. R. Experiments on the Effects of the Loss 
of Sleep. (Ph.D. dissertation, Stanford University, 1926, 148 pp.) 


The subjects were four college men who lived on a “4o percent 
sleep ration” for five days and five college men who went entirely with- 
out sleep for 72 continuous hours. A code writing test, an addition test, 
and Part I of the Thorndike College Entrance Test were used to meas- 
ure changes in mental efficiency. 

The author concludes: “Long periods without sleep cause tempo- 
rary loss in the ability of the individual to perform tasks. The moré 
the task involves the higher mental processes, the greater is the loss.” 
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During the 40 percent “ration” period “the losses were not as great 
as those in the period during which other subjects obtained no sleep.” 
Comment: ‘The general conclusion seems sound, but the results 
are difficult to evaluate precisely because (1) the subjects are so few, 
and (2) the losses in efficiency are measured in terms of percent of 
original efficiency and hence are not comparable on all the tests. 


141. *Lowe, Gladys M. ‘‘Mental changes after removing ton- 
sils and adenoids.’’ Psych. Clinic, 15: 1923, 42-101. 


The subjects were 60 children with defective adenoids and tonsils, 
of whom 35 were operated on after the first test. 


Non-Operated Cases Operated Cases 
[Seah 0@ Pond aS. F4 ot Ta Meier rte 83.0 89.2 
BO Masvear late vnc see wnees ese ees 85.4 90.7 
No effects of diseased adenoids or tonsils upon the I.Q. were 
manifest. 


142. *Olesen, R., and Fernald, M. R. ‘‘Endemiec goiter and 
intelligence.’? Public Health Reports, 41: 1926, 971-986. 


Measurements of 3800 sixth-grade Cincinnati children, both white 
and colored, showed equal Otis Test levels for those with thyroid en- 
largement and for those with normal thyroid glands in the white group, 
and Otis Test levels six to eight points higher for colored children with 
thyroids enlarged than for colored children with thyroids normal. The 
discrepancy in the colored group may be partly due to small numbers 
and consequent errors of sampling. 

When intelligence was considered in the white group with respect 
to size of thyroid enlargement, the results were these: 


Boys Girls 
Percentile N Percentile N 
Slielita eee s cette enw sealers a 6 46 387 5I 484 
MOG eratel tactics sais ecisisievsiel is, ¢ 50 42 50 105 
IME g ately. 6 Bae Neo aGyob Rae 32 16 45 53 


This suggests a lowering in the case of marked enlargements, but 
the cases are too few in number to be certain. 

The conclusion is: “There is no evidence that either boys or girls 
with slight or moderate degrees of enlargement differ significantly in 
intelligence from those with normal thyroids.” While marked enlarge- 
ment probably had some effect, there is “no justification for an asser- 
tion of general relationship between thyroid enlargement and lower 
intelligence.” 


143. *Paulsen, Alice. The Influence of Treatment for Intes- 
tinal Toxemia on Mental and Motor Efficiency. New York, 1924, 
45 pp. (Archives of Psych., 11: No. 69.) 
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Thirty experimental subjects and 30 control subjects, nearly all of 
them young women in their twenties, were matched fairly closely for 
Terman Group Test scores. The experimental group were suffering 
from intestinal toxemia at the beginning of the experiment, but improved 
in this condition during the month of the experiment, during which 
remedial treatment was given. 

A battery of 17 tests was administered to both groups at the begin- 
ning and end of the experiment, and the net average gain of the experi- 
mental group was computed. This tended to be from about one-sixth 
to one-third of the S$. D. of the experimental group on the 12 mental 
tests, and from about a half to a whole S. D. on five motor tests. The 
average net gain was 8.8 times its P.E. on the whole battery. 

The mental tests used ranged in complexity from opposites and hard 
directions down to cancellation and color naming. 


144. *Pearson, K. ‘‘On the relationship of intelligence to size 
and shape of head.’’ Biometrika, 5: 1906-07, 105-146. 


A group of 1000 Cambridge men were rated on a four-point scale 
according to honors or pass degree. Over 5000 school children were 
rated on a six-point scale for intelligence. Boys and girls were classi- 
fied separately. Measurements of the school children were reduced to 
a standard age of 12 by aid of the average growth curves. Correlation 
ratios were computed between these ratings and certain physical meas- 
urements as follows: 


Cambridge Boys Gils 


Cor. Cor. Cor. 
Ratio | P.E. | Ratio | P.E. | Ratio | P.E. 


Intelligence and cephalic 


INCEXeaI  t ec ete ess ere — .061 .021 |—.041 .014 .067 .014 
Length of head............. pila .020 .139 .014 | .084] .014 
Breadth of head’) 2. so <1 .097 .021 .109 .014 sili .014 
AuTicularyherp hts ce Geet sists | es ace ei teers, eee .073 .014 .055 .014 


145. *Rogers, M. C. Adenoids and Diseased Tonsils; Their 
Effect on General Intelligence. New York, 1922, 70 pp. (Archives 
of Psych., No. 50.) 


Twenty-eight boys (ages 6 to 14) having diseased adenoids and 
tonsils were tested by the Stanford Binet Test the mornings upon which 
they had adenoid and tonsil operations, again six months later, and again 
12 months later. Their I.Q.’s averaged 2.25 points higher after six 
months. But 28 control boys, comparable to the above groups for age 
and diseased condition of adenoids and tonsils, gained 3.29 points on 
a second test after six months, though they had had no operation. -After 
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I2 months, 21 boys still available in the operated group had risen 
3 points and their 21 paired controls had risen 6 points. No significant 
correlation was found between age (presumably length of time defect 
had been present) and rise in I.Q. 

Two other groups of children in a large public school showed no 
difference in average I.Q. when separated into those who had diseased 
adenoids and tonsils and those who had normal adenoids and tonsils. 

Comment: The study seems to establish definitely that diseased 
adenoids and tonsil conditions have no unfavorable effect upon intelli- 
gence as such. The article would contain additional interest if the I.Q.’s 
could have been watched over a longer period of time. 


146. Smillie, W. G., and Spencer, Cassie R. ‘‘Mental retarda- 
tion in school children infested with hookworms.’’ Jour. Educ. 
Psych., 17: 1926, 314-321. 


The Otis Test and the National Intelligence Test were given to 
118 children in a hookworm area in an Alabama county. Only Otis 
I.Q.’s were used in the analysis, but these were “verified” by one or two 
N.I.T. scores. The Stoll ova count method was used at the Andalusia 
hookworm research laboratory to estimate the number of hookworms 
harbored in each child. Since all children lived on neighboring farms 
and had white American parents, the authors believe that selection prob- 
ably does not enter. 

TABLE OF RESULTS 


I.Q. Group N Estimated Number 
of Hookworms 
POS UL Fi geaccdore 26 ood ata aioe ola oe lot 4 30 
QS 104 street Nese stabs bust ce BGs arate le miren 18 47 
So amOAS mee iay wrsitheiated ok sPonars ouitel atutena iets 45 67.8 
OR USA Meare tno eck sales eo uals aj ae 36 172 
GG wel AIS rite hier tended sustain ote er da esl sanbpoueeteee 15 281 
Intensity cf In ection N Mean I.Q. P.E. 
INGZatiVe meer cee oe aes oe kak oe Seloahe ee 90.2 1.53 
Wersyg tite (125) eerste pets crcye ete eceysvsisy> oieici a. <\< 40 88.3 1.06 
Dei ge hat ( 261 OO ae cies creke stabete tie, shear aebedeqst sven 27 86.4 2 
Moderaten( O1[500) exer eee yee sails eaisierst> 23 84.1 1.07 
Heavy 601-2000): inv cise ters sisi sieste waters « LOgas 76.3 1.74 


It is stated that this study contributes data which previous hook- 
worm studies have not provided concerning the effects of various in- 
tensities of hookworm infection. 

Comment: The authors’ belief that selection does not affect results 
is not demonstrated. It still remains possible that children of incom- 
petent parents were more likely to be exposed to hookworm. 
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147. Terman, L. M. The Intelligence of School Children. 
Cambridge, 1919, 313 pp. 


For 27 children retested after an operation for tonsils or adenoids, 
the average gain in I.Q. was two points. There were 10 losses and 
17 gains, no gain larger than 14 points, and only two larger than Io 
points. The results “suggest that adenoids and diseased tonsils may 
give a child an exaggerated appearance of dullness.” 


148. *Thorndike, EB. L., McCall, W. A., Chapman, J. C. Ven- 
tiation in Relation to Mental Work. N. Y., 1916, 83 pp. (Teach- 
ers College, Columbia University, Contrib. to Educ., No. 78.) 


Conditions of ventilation had no effect upon the mental efficiency 
of over 100 college students, measured on the following tests: naming 
100 colors; cancelling 2’s for 60 seconds; cancelling 3's for 60 seconds; 
50 hard opposites; addition of Io digits for 10 minutes; mental mul- 
tiplication of 3-place numbers by 3-place numbers for 20 minutcs; 
20 minutes of typewriting, and other miscellaneous tasks. 

The conditions ranged from 68° F., 50 percent relative humidity, 
and 45 cu. ft. of air per person per minute to 86° F., 80 percent relative 
humidity, recirculated air and stagnant. Favorable conditions and poor 
conditions were alternated daily in some experiments and conditions 
were held constant for five consecutive days in others. 


149. *Thorndike, E. L., Ruger, G. J.. McCall, W. A. ‘‘The 
effects of outside air and recirculated air upon the intellectual 
achievement and improvement of school pupils.’’ School and Soc., 
3: 1916, 679-684. 

(See Thorndike and Ruger, No. 150.) 

150. Thorndike, E. L., and Ruger, G. J. ‘‘The effects of out- 
side air and recirculated air upon the intellectual achievement and 
improvement of school pupils (a second experiment).’’ School 
and Soc., 4: 1916, 260-264. 


The results of article No. 149 are also summarized here. 

On two successive years experiments were made upon two classes 
of about 40 pupils each with respect to the relative influence of outside 
air and recirculated air upon mental and school efficiency. In the groups 
supplied with recirculated air occasional outside air was introduced, but 
steam consumption in classroom heat was diminished 50 percent. Dur- 
ing one-half a school year the groups supplied with recirculated air 
showed a very slight tendency toward larger gains than the “fresh air 
groups” on tests of the intelligence and achievement type, and on 
school marks. 

Comment: The slight tendency toward larger gains in the “recir- 
culated air groups” is possibly accidental or due to errors of sampling. 
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It is significant, in any event, that these groups show no diminution 
in efficiency. 

151. *Yerkes, R. M. (Editor) Psychological Examining in the 
Umted States Army. National Academy of Sciences Memoirs, 15: 
1921, 890 pp. (Ch. XIII. ‘‘Influence of Certain Physical Condi- 
tions on the Intelligence Score.’’) 

(1) At Camp Custer 178 men who had had the complete typhoid 
inoculation about 24 hours before were compared with the entire incre- 
ment of the same month, 7,167 men. The 178 had a mean Alpha score 
of 67.7 and S. D. of 33.5. The 7,167 had a mean Alpha score of 67.1, 
and S. D. of 35.6. This comparison “is deemed more satisfactory” than 
one with 239 men of the same unit (not inoculated) who had a mean 
score 74.2 and S. D. of 35.3 because the latter “group contained a good 
many superior men who were being held over....because of possible fit- 
ness for officers’ training camps.” Seventy men “who themselves judged 
that their showing on the scale had been affected by their recent inocula- 
tion” had a mean score of 67.4 and S. D. of 33.5; 108 who “felt this to 
have had no effect” had a mean of 67.9 and S. D. 38.8. 

(2) Data are reported showing recruits infected with hookworm 
to be distinctly inferior, and venereals somewhat inferior, but it is 
pointed out that selection is a possible explanation. In fact, 317 white 
venereals, who became infected after the test, were distinctly below 
unselected men at the same camp (Lee). 


6. Relation to Age or Maturity 


Selection complicates most studies under this heading (when 
scores of different age groups are compared). A few studies, how- 
ever, give retests of the same children at different ages, and pro- 
vide crucial evidence on the duration of mental growth. In most 
studies thus far made, however, we are not justified in assuming 
that the actual form of mental growth is shown in more than a 
very general way, since the scaling of most tests is not such as to 
enable us to say that one unit at a given level on the scale repre- 
sents the same increment of ability as does one unit at another level. 

A point of special interest concerns the average adult mental 
level as compared with the mental level of young people in their 
teens and still at school. Growth curves for young people usually 
extend to a level higher than that representing the average adult 
mental level, but whether this fact is due to actual decline of adult 
capacities or merely to the greater remoteness of school training in 
the case of adults is difficult to ascertain. 
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152. Ballard, P. B. ‘‘The limit of the growth of intelligence.’’ 
Brit. Jour. Psych., 12: 1921, 125-141. 


Lumping results from an absurdity test given to elementary-school, 
secondary-school, and continuation-school pupils and evening-school 
adults, it is found that growth practically ceases at age I5. 

The author admits that he can not be sure of having unselected 
groups, but believes that he has demonstrated no growth on this test 
after 16 because in secondary schools, where selection might be expected 
to raise the scores of older pupils, the maximum level was reached 
at age 15. 


153. Brooks, F. D. Changes in Mental Traits with Age. New 


York, 1921. (Teachers College, Columbia University, Contributions 
to Education, No. 116.) 


When 171 children in Grades IV to IX and between ages 9 to 15 
were tested twice, a year apart, on a battery of 16 intelligence and 
achievement tests, there was an increase at all the ages. There was a 
slight tendency for the increase to be less in magnitude at 14 to 15 than 
at 9 to 10, but the significance of this is clouded by possible selective 
factors. 


154. *Cobb, Margaret V. ‘‘The limits set to educational 
achievement by limited intelligence.’’ Jour. Educ. Psych., 13: 
1922, 449-464. 


The increase in Alpha scores from freshmen to senior standing in 
the high school which are due respectively to (1) mental growth to- 
gether “with the temporary effects of instruction” and (2) elimination 
and the “more permanent effects of instruction” are roughly estimated. 

The superiority of senior scores over freshman scores in high 
schools of eight states tended to be about 30 points. But the superiority 
of army recruits who had been seniors in high school over those who 
had been only freshmen in the high school was only 17 points. Thus, 
(1), above, seems to account for about half the difference between 
freshmen and seniors. But (1) seems to account for no significant 
superiority of seniors over juniors by a similar comparison. This superi- 
ority is much less than in other grade comparisons. 


155. Hart, H. ‘‘The slowing up of growth in mental test 
ability.’’ School and Soc., 20: 1924, 573-574. 


A total of 579 Iowa children born in 1908 attending intermediate, 
high, and part-time schools in Davenport, lowa, was studied. As the 
tests were given in December, 1923, all the children were between 
15 and 16. ‘S 

The Army Alpha test gave an r with C. A. of .10 = .03, which cor- 
responded to a gain of 5.16 (or almost surely between o and Io points, 
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allowing for 3 P.E. variation). If the gain were only Io points a year 
since year 9, the average Alpha score would be 30 at age 9, whereas 
it is only 4 by Alpha norms. These facts are taken as a demonstration 
that growth is much slowed up at 15. 

Comment: The possible factor of unequal units at lower and higher 
levels obscures the results somewhat. 


156. Hopkins, L. T. The Intelligence of Continuation School 
Children. Cambridge, 1924, 132 pp. 


All the 14- and 15-year-old children in five Massachusetts communi- 
ties were tested on the Dearborn Intelligence Scale. These included 
both continuation-school and regular-school pupils. In the regular- 
school group 60 percent of 14-year-olds and 69 percent of 15-year-olds 
overlapped the medians for the same ages of the regular and continua- 
tion pupils combined. In the continuation-school group 18 percent of 
14-year-olds and 28 percent of 15-year-olds overlapped these medians. 
The difference between ages 14 and 15 for the two groups combined 
was about 4.5 points. 

Comment: Since the variability of the group is not reported, the 
significance of the age gain is not entirely clear. 


157. Symonds, P. M. ‘‘Second approximation to the curve of 
the distribution of intelligence of the population of the United 
States.’’ Jour. Educ. Psych., 14: 1923, 65-82. 


By taking the medians and first and third quartiles Army Alpha 
scores for occupational groups (as given by Fryer) and weighting these 
according to the frequencies of the occupations listed in Table 14, p. 53, 
vol. 4 of the 1910 U. S. census, a mean Alpha score of 50.66 is obtained, 
a median of 48.41, and an S.D. of 43.00. For the army, a weighted 
composite median of the white draft (native and foreign-born) and the 
colored draft was 51. 

The author thinks the Stanford Binet was standardized on selected 
groups, because occupations testing at median on Alpha test have an 
1.Q. of only 80 or 82 on the Stanford Binet, which would correspond to 
an M.A: of 13.2. 


158. Teagarden, Florence M. A Study of the Upper Limits of 
the Development of Intelligence. New York, 1924. (Teachers Col- 
lege, Columbia University, Contributions to Education, No. 156.) 


The subjects were 408 children of 12 to 20 at “Mooseheart,” a 
cottage-plan institution for dependent children of Moose lodge deceased 
members. Feeble-minded children are not accepted there. ‘The aver- 
age I.Q. was 93.5, but this may be a little low, since 16 was used as the 
‘critical divisor’ for the older ones. The author believes that selection 
from age to age is not operative, except in those over 18, because 
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“school mortality” does not exist in the ordinary sense. Children are 
not admitted after 14, but are kept and educated through their teens— 
by academic training if they can take it, by vocational training otherwise. 
The Stanford Binet, Army Alpha, Pressey Senior Classification 
Test A, and Stenquist Mechanical Aptitude Test I were administered. 
Three ways of scaling tests to make them comparable with them- 
selves and each other at all ranges were tried, and gave almost iden- 
tical results; so one only was used which assumed a normal distribu- 
tion for the total population of children. The author points out objec- 
tions to this assumption, but thinks it will answer for practical purposes. 
With slight negative accelerations, the tests all show increases be- 
yond 16; Binet at least to 18, Alpha to 16 or 16.5, Pressey to 17 or 
17.5, Stenquist to 18, and Alpha to 16.5 or 17.5. 
Comment: The possibility of selection does not seem entirely ruled 
out, although the results might perhaps be accepted tentatively until 
more crucial ones can be obtained in the future. 


159. *Thorndike, EK. L. ‘‘On the improvement in intelligence 
scores from fourteen to eighteen.’’ Jour. Educ. Psych., 14: 1923, 
513-516. 


From 2638 to 3136 pupils in each of Grades IX, X, XI at first 
testing, and in Grades X, XI, and XII at second testing one year later 
served as subjects. 

Form A of composite Institute of Educational Research tests was 
given to about half and Form B to the other half of the pupils in 1922; 
then the forms were reversed in 1923. The two forms were equated 
for relative difficulty, and a practice effect averaging 11.9 was found 
by taking scores of pupils (not counted in the subjects above) who took 
the test for the first time when the others were taking it the second 
time, and by comparing the scores. 

The median gains for the pupils in Grades IX, X, XI in 1922 (after 
practice effect was allowed for) were 10.5, 11.7, and 11.5. The average 
gain was II.I. 

This is one-third of the mean square deviation of all the pupils on 
their first trial, “or about one-half of what their mean square deviation 
would be if they were perfectly measured.” The estimate is made (how 
not stated) that the variability of all the pupils is at least half that of 
unselected 14-year-olds, who have an estimated mean square deviation 
of 23 months Stanford Binet M.A. Thus, the gains “may be set as 
equivalent to at least 10 months of mental age around 14.” 

Discussing the apparently constant rate of gain from Grades IX 
to XII, the author says “any decrease in gain with age, if such there 
be, is offset by the selection of those more capable of gain.” 


160. *Thorndike, KE. L. ‘‘On the improvement in intelligence 
scores from thirteen to nineteen.’’ Jour. Educ. Psych., 17: 1926, 
73-76. 
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Several thousand high-school pupils from 13 to 18 in two cities 
were tested on J.E.R. Test of Selective and Relational Thinking. Se- 
lection was allowed for by finding an upper and lower limit of “superior- 
ity” in gains in one year (Form A and Form B) of younger pupils 
over older ones. Practice effect was also allowed for. 

The gains of pupils from age 13 to 14 at various grade-placements 
is not significantly greater than those of pupils from age 14 to 15, etc., 
up to 18 to 19. 


161. Thurstone, L. L. ‘‘A method of scaling psychological and 
educational tests.”’ Jour. Educ. Psych., 16: 1925, 433-451. 


Two fundamental formulas are derived for establishing on an ab- 
solute scale the mean ability of an age-group when that of an adjacent 
lower age-group is known. Using these formulas upon the scores of 
3000 London children from ages 3 to 14 tested by Burt on his Binet 
Revision, it is found that mental growth shows little signs of slowing 
up at age 14. The author concludes from the form of curves that it 
might “reach a limit in the early twenties or perhaps even at the age 
of 20, but it can hardly be extended to reach a limit much sooner than 
that.” 

Comment: ‘The conclusion may possibly be distorted by selective 
factors of school mortality in Burt’s data. 


162. Whipple, G. M. ‘‘Endowment, maturity, and training 
as factors in intelligence scores.’’ Sci. Monthly, 18: 1924, 496-507. 


“In one Michigan city it was possible to test every 14-year-old and 
every I5 year-old child. The average scores for 14 and I5 years were 
139 and 142 respectively; 1.e., the performances in these two ages were 
practically indistinguishable.” 

Increments from 8 to 9, etc. up to 14 to I5 are: 15, 15, 17, 16, I5, 
9, 3. (The National Intelligence Test was used.) Numbers or ‘school 
status of pupils were not stated. 


163. Woolley, Helen T., and Fischer, Charlotte R. Mental and 
Physical Measurements of Working Children. New York, 1914, 
247 pp. (Psych. Monographs, 18: No. 77.) 


The subjects were native-born white children applying for work 
certificates at age 14, who had not been out of school longer than three 
months. 

Mental tests selected from Whipple’s Manual were administered 
twice, apparently to several hundred children a year apart (within two 
months plus or minus). Physical tests were also given (which are 
not commented upon here). Cancelling a’s test; opposites; rote mem- 
ory for 7, 8, and 9 digits; sentence completion (13 sentences); card 
sorting; form-board; and puzzle box (content was changed for second 
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test) were the mental tests used. The authors conclude that “the 
greater part [of the gain] can be attributed to the added year of 
maturity.” 

Comment: Results are presented in tabular form, but the stand- 
ardization of the tests and data on dispersion are not adequate for pre- 


cise interpretation. It is not possible, either, to separate practice effect 
from actual gain. 


164. *Woolley, Helen T. An Experimental Study of Children. 
New York, 1926, 762 pp. d 


The treatment of data gathered ten years previously and reported 
in the 1914 and 1915 references is extended. Consecutive tests were 
given both to the working children and school children from ages 14 
to 18. The study started out with 760 school children and ended with 
152; and with 753 working children and ended with 511. A fresh influx 
of school children was added at age 16. While possible practice effect 
on test scores had not been adequately allowed for, it is significant that 
in the total group of school and working children combined: “The 
easier the mental task, the earlier an approximately adult status is 
reached. ‘Thus, in memory an adult status is attained for the seven- 
place series at 15, for the eight-place at 16, and for the nine-place at 
17 years.” “Substitution” shows negligible gains beyond 16 years. 
Growth in these functions continues as long for the working children 
as for the school children. 


Comment: All children tested at any age were used in the com- 
parisons of age gains. It might have been more valuable to make com- 
parisons using only the identical children throughout. The possibility 
that practice effect may account for a small or even large part of the 
gains cannot be overlooked. But the fact that gains occur in working 
children as long as in school children would indicate that schooling is 
not responsible for the gains. 


165. Woolley, Helen T. ‘‘A new scale of mental and physical 
measurements for adolescents, and some of its uses.’’ Jour. of 
Educ. Psych., 6: 1915, 521-550. 


Seven hundred fifty 14-year-old children were tested, and 680 of 
the same children were retested at age 15 after they had dropped out 
of school and been at work a year. 

A series of physical and mental tests was given twice with a year’s 
interval. 

The opposites test was also given, but a harder list at age 15 than 
at 14; so it is not comparable. 


Comment: ‘The study is significant in showing that, even with 
children who have stopped school, there is some gain from 14 to I5. 
Practice effect is not guarded against, however. 
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Gain 14th-15th Year 


Test ——__——__—_—_—_} Q of Boys 
Boys Girls 

Cancellation index.c twice: fsiaees owes eek +37 +44 105 
Cancellation (percent accuracy)......... aye il 14 20 
Memory, 7-placenGgigits....0+.s.s) .esnere: 3 4 16 
Memory, S-place digitsa.cd.s ac. voces hen 6 4 26 
Substitution (4th page), (other pages seem 

TORCHECk am eRe en nee lech te de Gy 6 9 40 
Sentence completion (number correct)... ... ileal 0.7 2.8 


166. Yerkes, R. M. (Editor) Psychological Examining in the 
United States Army. National Academy of Science Memoirs, 15: 
3, 1921, 890 pp. 


Chapter 14. “Relation of Intelligence to Age.” 

A group of 15,385 white officers from the “principal sample” had 
a median Alpha score of 150 for 24 cases of age 20 or less. The 
scores remained at about 146 from age 21 through 26, and then dropped, 
slowly at first, more rapidly later, to 120 in the group of 51 to 60. 
Fight cases of age 60 and over had a median of 140. The author points 
out that the unknown selective influences make the results ambiguous. 
Data from other scattered officer groups corroborate the form of curve, 
and data from over 1000 enlisted men, half each on Alpha and Exami- 
nation a, show no decline from 20 to 30, thus confirming the results 
on officers. But in a group of 600 negro recruits the medians of men 
21-23 and 29-31 rose from 34.8 to 40.5 (Alpha). 

The following r’s were found between age and Examination a: 


5000 Med oficersi( ef, below) c.'s teess'es ccc cine ers —.192 
ZOOL CAs OMICOPS1 ( 21-30) a) sare ehciesicesivie diss anise —,.063 (P.E. .013) 
Bovon ed Momicers<( 30-00) ccs aie volss ie wise baie ss are —.192 (P.E. .o11) 
WAQH INA CO MOMICCES Ee crataie cs tatae site als ol solos saa’ ies —.OII 

Bow NLAerysOUICElS 6 seis ss 02a s1s.2 a tatters «ieee « —.120 


Chapter 11. “Intelligence of the Draft.” 


The principal sample of scores of the white draft, when transmuted 
from Alpha and Beta examinations into terms of mental age, have a 
median of 13.08 years. 


7. Effect of Volitional and Emotional Traits 
This is a most important topic, upon which little experimental 
literature can be found. 
167. Bronner, Augusta F. ‘‘Attitude as it affects perform- 
ance of tests.’’ Psych. Rev., 23: 1916, 303-331. 


Using Binet’s 1911 revision and tests from Healy and Fernald’s 
Tests for Practical Mental Classification, the author reports case studies 
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of children tested at the Juvenile Psychopathic Institute who improved 
two or three years or more in M.A. on a second testing a few days 
or weeks later. Clinical descriptions make it evident that blocking of 
performance on Binet as well as on Healy-Fernald tests was often 
due to the following factors: 

(1) Deliberate deception 

(2) Recalcitrancy 

(3) Sportiveness 

(4) Emotional disturbances, including general depression, anger 
or resentfulness, fear, sheepishness, shyness, embarrassment due to on- 
lookers, homesickness, mental conflict 

(5) General excitement, including exceptional desire to do well 

(6) Lack of confidence 

(7) A combination of causes 

Comment: 'The small numbers and subjective character of the 
clinical observations prevent the study from yielding precise conclu- 
sions; but the results seem important and suggestive, nevertheless. 


168. *Hurlock, Elizabeth B. ‘‘The effect of incentives upon 
the constancy of the I1.Q.’’ Ped. Seminary, 32: 1925, 422-434. 


One hundred forty-one eighth-grade and 132 fifth-grade children 
in two New York public schools were divided evenly into a control 
group, “praised” group, and “reproved” group on the basis of the Na- 
tional Intelligence Test, so that the mean and S.D. on the test were 
almost identical, as were the C.A., I.Q., and proportion of each sex in 
each group. 

The National Intelligence Test, Scale B, Forms 1 and 2 were ad- 
ministered a week apart. The three groups took it together the first 
time, but took it in single groups the second time. ‘The second time 
the control were merely given standard directions; but the “praised” 
were told they had been singled out from the entire class; the “reproved” 
were told they were a disgrace to the class. 


Results: 
Aver. I.Q. S.D. Increase Increase in 
First Test First Test | Second Test | Units of P.E. 
of Diff. 
GOnErOl fei weisioe aieiare ale eielevoralerelsioeleioieis 100.23 18.11 69 -40 
IPT Aine dare teveistere tere eicistereleisievevers eieleicic/e\cese 100.30 18.02 6.74 3.62 
Reproved ts. miviserioaleiae svelte icieiare resto 100.43 18.04 | 6.88 3.72 


Advantages are found, therefore, for both the praised and the re- 
proved groups. 


169. Hurlock, Elizabeth B. The Value of Praise and Reproof ~ 
as Incentives for Children. New York, 1924, 78 pp. (Archives of 
Psych., No. 71.) 
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(For abstract, see summary No. 231.) 


170. Jewett, S. P., and Blanchard, Phyllis. ‘‘Influence of 
affective disturbances on responses to the Stanford Binet test.’’ 
Mental Hygiene, 6: 1922, 39-56. 

The authors conclude that I.Q.’s on the Stanford Binet are modi- 
fied by psychoses, drugs, or emotional disturbances. Fourteen case 
studies are given in which very radical shifts of measured I.Q. can 
reasonably be attributed to disturbances of personality. 

171. Sullivan, E. T. Mood in Relation to Performance. New 
York, 1922, 69 pp. (Archives of Psych., No. 53.) 

No relationship was found among a group of adults between an 
introspective measure of cheerfulness and their performance on a bat- 
tery of short mental tests. The conclusion was: “Extremes of mood 
act as distractions, the absence of any demonstrable effect on perform- 
ance being due to the tendency of the individual to overcome resistance 
by increasing the output of energy.” 


8. Constancy of Intelligence 


Studies upon stability of intelligence over short or long periods 
may show that whatever is measured is a permanent attribute, but 
can not show what determines it unless retests are analyzed in 
connection with known changes in environment. Nevertheless, it 
is important merely to know whether or not intelligence is stable, 
since a tendency toward unstable intelligence, if found, would be 
favorable to an environmental theory, even though stable intelli- 
gence would not be crucial evidence for a theory of heredity. 


172. Baldwin, B. T., and Stecker, Lorle I. The Mental Growth 
Curve of Normal and Superior Children Studied by Means of Con- 
secutive Intelligence Examinations. (University of Iowa Studies 
in Child Welfare, 1922, No. 1, 61 pp.) 

For a group of 36 children who were given five successive Stanford 
Binet tests at average intervals of 13 months, 15 months, 10 months, 
and 5 months, the correlations between tests ranged between .74 (first 
and third test), and .92 (fourth and fifth test). The correlation for 
the first and fifth test was .82 + .04. Correlations for somewhat smaller 
groups tested on two to four occasions ranged from .72 to .93. 

173. Baldwin, B. T. ‘‘ Additional data from consecutive Stan- 
ford Binet Tests.’’ Jour. Educ. Psych., 13: 1922, 556-560; also in 
Jour. Educ. Research, 8: 1923, 375. 


(See Dickson’s tabled summary, No. 177.) 
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174. Broom, Eustace. ‘‘Constaney of the I.Q.’’ School and 
Soc., 25: 1927, 295-296. 


The Terman Group Test was given twice to 50 high-school pupils. 
The intervals between tests varied from 6 months to 35 months. On 
the first test the mean I.Q. was 110 and S.D. 12. On the second test 
the mean I.Q. was 110 and §.D. 10. Age 16 apparently is the basic 


C.A. used for pupils over 16. The r between test and retest was 
.86 = .03. 


175. Cox, Catharine M. Genetic Studies of Genius, Vol. I. 
The Early Mental Traits of Three Hundred Geniuses. Stanford 
University, 1926, 842 pp. 


This represents one of the most significant studies of I.Q. con- 
stancy ever made. In the author’s words: “The purpose of this study 
is to characterize a group of young geniuses with respect to certain 
mental traits. The subjects described are 301 of the most eminent men 
and women of history. The data discussed are the historical records 
of their heredity, their childhood, and their youth.” 

From the first 510 on Cattell’s list of the 1000 most eminent indi- 
viduals of history, 282 were selected who were born between 1450 and 
1850. Hereditary aristocracy and nobility were excluded. These 282 
formed the main group of the study from which the chief conclusions 
were drawn. 

Upon the evidence of data collated from 3000 biographical sources, 
Professor Terman, Professor Maud A. Merrill, and the author esti- 
mated the I.Q.’s of the subjects, (1) in childhood up to age 17, and 
(2) in the period from 17 to 25. “The ratings were made on the basis 
of the standards and norms for intelligent behavior established by 
mental tests.” 


Aver. I.Q. Same, Same, 
Number up to Corrected | Aver. I.Q. | Cocrected 
17 Yrs. for 17-25 for 
Regression Regression 
Artists and Writers (poets, 13 122 140 135 160 

novelists, dramatists)....... 52 141 160 149 165 
Writers (essayists, historians, 

Oritics, SCHOIATS). «64 ../6 «011 43 139 160 148 170 
IVETISIOIAIIS fy cre crete plein sictelsioyerenre 11 130 145 140 160 
Boldiersty e\sto cts susseseleraarsts leer 27 115 125 125 140 
SGIENCISEB ss ).10%5!e.01ecel cuolslevensyeretace 39 135 155 152 175 
PTOROPHERB ies. okeie'« ceaetekelenalere 22 147 170 156 180 
Revolutionary statesmen...... 9 140 160 144 165 
Religious leaders...........-. 23 132 150 145 170 
Statesmen achien siscwsiee tocrents 43 135 155 142 165_' 
Hs NeG UG Bimias o ersfelsin. otters 10 175 185 185 195 
THOW CSU. QAR acy, tie acesiels it sralets/ de 10 113 140 123 150 


Total WGroups els stv era hrolereistele 282 135 155 145 165 ; 
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Ten sub-groups dividing the subjects upon the basis of field of 
activity were formed. The average estimated I.Q. of members of these 
sub-groups (by all three raters) for both (1) the early period and 
(2) the later period of youth, are tabled below. Average I.Q.’s rep- 
resenting the probable level of the subjects after regression of ratings 
toward the mean of the generality due to the unreliability of the data 
had been allowed for are also tabled. 

Data are also given upon character traits of the subjects in child- 
hood, but these are not summarized here. The most significant con- 
clusion of the author is that “the extraordinary genius who achieves 
the highest eminence is also the gifted individual whom intelligence 
tests may discover in childhood. ‘The converse of this proposition is 
yet to be proved.” 

Comment: ‘The validity of the correction applied for regression 
toward the mean of ratings based upon unreliable data is open to ques- 
tion, since it cannot be told, a priori, whether certain data were scanty 
(z.e., “unreliable’) because there actually were few indications of 
precocity in the boyhood of the men in question or because the data 
on their precocity simply failed to be recorded in their biographies. 
Nevertheless, the figures represented by the uncorrected estimated I.Q.’s 
alone are salient enough to give emphatic support to the author’s 
conclusion. 


176. Cuneo, Irene, and Terman, L. M. ‘‘Stanford Binet tests 
of 112 kindergarten children and 77 repeated tests.’? Ped. Semi- 
nary, 25: 1918, 414-428. 

(See Dickson’s tabled summary, No. 177.) 


177. Dickson, V. Mental Tests and the Classroom Teacher. 
New York, 1923, 231 pp. 


In pp. 65 ff. dealing with I.Q. constancy, original data are presented 
and other studies are also summarized. 

Original Data. ‘These data embrace 288 retests (or 576 individual 
tests) by 84 different examiners—chiefly by teachers who had been, 
or were being, trained in mental testing. The subjects ranged from 
4 to 16 years, with a median I.Q. 86, and an I.Q. range from 33 to 140. 
The time between the tests varied from less than one month to three 
years, with a median interval of 13 months. Between the two tests 
the r was .90, the median difference 5.1 points, and the median increase 


.5 point. 
The amount of change was distributed thus: 
No. Points Percent 
Difference of Tests 
o0—I10 88.2 
TL -——10 9.7 


over 20 2.1 
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The probable causes of changes greater than 10 points were: 


Foreign language dificulty..: + o.% «0 enc 68 17 cases 
One or both tests incomplete..........s0s000¢ Oar 
Psychopathtercases’, cuivasn ts cactisaete ss cota 003 males 
Notapparenticauses. nics sa wie oe hee oes 3 


SumMMARIzED Data From OTHER SOURCES 


Experimenter r Cases 
Ben quisty cr tiv tetc cc eee olercrowtesiac alas sietere dolerite ie etere wt 274 
Ruse. and Comloton. ace sates satcicioies sole ciels-ere eee ee tevorsiels .84 137 
POLINA 7 w/eveletncr dis alelere S05 inns crevevs (3) cree sials ters oosie a's avels/aeetsyalsls -93 435 
BS ell iwiltver ere yavete(alelavotoe sictasemvareiPa cto caickersiere/ate archwires oes .74 to .94 (Various groups) 
Gordon Mane leeiere tore ccvelciele.crcisieiers orate orsieroloceWelas iota ewien .84 44 
Bobertag sstarcaiereclelciccie' slo seve s v.sivielsiwrcistererseee ore sheierrsiseeats 95 
FROSCNOW a .xsperarofott oie oie wie: sielers sveless cis wross.giclstae a ensioneiteiore st 82 ets 
Cuneo,and Terman sein caeleerence cio ae onc lon scion eietore pieteieiears 85 31 (2-year interval) 
@uneo,and Terman nary cciesaicie clase a sisiee eslensa veraies ans -94 21 (6-month interval) 
Wuneoiand Merman ssh,c-sisc orerere oriole eiatciactete efiiste min cleien .95 25 (2-day interval) 
GATTISOM aa rolerere terete aoe erutalcherelwlaeheteiererat a renee re olka reketehs .88 298 (1-year interval) 
GAPrisOM va eteioie ters leieere alec ciate ite steht ake clare serene tole nies 91 127 (2-year interval) 


Garrisontvers.. foe do acne eaten: pinindines Mosler Leiter ee .83 42 (3-vear interval) 


178. Dvorak, August. ‘‘The relation of I.Q. to the prognosis 
of special-class pupils.’? School and Soc., 19: 1924, 736-744. 


A theoretical discussion explaining why low I.Q.’s shculd become 
still lower and high I.Q.’s should become still higher if the increments 
of yearly growth become consistently smaller with advancing C.A. If 
the rate of growth of bright and dull children with respect to the 
normal remains a constant, dull children will be slowing up in growth 
at a place on the M.A. scale where small increases of M.A. mean 
larger increases in absolute ability, and bright children will be tested 
at a place on the scale where small increases in absolute ability mean 
larger increases in M.A. Hypothetical growth curves are worked out 
upon this assumption and shown to be parallel to curves based on data 
from Kuhlmann’s retests over Io years of 639 feeble-minded children 
(Jour. Appl. Psych., Sept., 1921). 

The results of Woodrow (Brightness and Dullness in Children) 
and Baldwin and Stecher (Mental Growth Curve of Normal and Supe- 
rior Children, 1922) bear out the author’s contention. 


179. Freeman, Frank N. Mental Tests. Cambridge, 1926, 
503 pp. 


On his p. 345, data from retests on the Stanford Binet are sum- 
marized as shown here on p. 323. 
180. Garrison, C. ‘‘ Additional retests by means of the Stan- 


ford Revision of the Binet-Simon Tests.’”’ Jour. Educ. Psych., 18: 
1922, 307-312. ; 

(See tabled summaries by Dickson, No. 177, and by Freeman, 
No. 179.) 


~ 
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[To be read with Item 179, p. 322.] 


Percentage Coefficient 
Author Number | Differing Limits of Average | of Correla- 
Cases 10 Points Middle 50 Change | tion Between 
or More Percent Two Tests 
Perman -/5 savuinee ese icles « 435 Altay —3.3 to +5.7 4.5 .93 
Rugg and Colloton......... 137 aay? —2.3 to +5.6 4.7 84 
Garrison ies cis cls s eiclee tees « 468 085 (-2 to+4 ) 5.4 88 
(-—3 to+4 ) 
(—3 to+5 ) 
Muggy Us. iSeeesieesctree Saes 114 —1.2to +1.9 Eyal 95 


181. Gordon, Kate. ‘‘Some retests with the Stanford Binet 
Seale.’’ Jour. Educ. Psych., 18: 1922, 363-365. 


(See tabled summary by Dickson, No. 177.) 


182. Gray, P. L., and Marsden, R. E. ‘‘The constancy of the 
intelligence quotient—final results.’’ Brit. Jour. Psych., 17: 1926, 
20-26. 


Of a group of elementary pupils in a large city in England in 1920, 
44 were tested twice, 12 three times, and 23 four times, at yearly 
intervals. 

The Stanford Binet was used. All possible comparisons yielded 
616 pairs of tests. 

The central tendency of change is —1.3 points, and the P.E. of 
prediction is 5.5 points (based on the 616 pairs). ‘The r for all these 
pairs is .85; for 100 pairs after first retest .88 = .02; for 371 compari- 
sons at last test, .85 = .o1. 


183. Henmon, V. A. C., and Burns, Helen M. ‘‘The constancy 
of intelligence quotients with border-line and problem cases.’’ 
Jour. Educ. Psych., 14:1923, 247-250. 


Fifty-nine cases who had an average I.Q. of 76 on the Stanford 
Binet gave a correlation of .91 between the first and second tests and 
an average difference of 5.3 points. The median loss on retest was 
1.75 points. The median interval between tests was about two years. 


184. Hildreth, Gertrude. ‘‘Stanford Binet retests of 441 
school children.’’ Ped. Seminary, 33: 1926, 365-386. 


“Data for this study were obtained from Stanford Binet test rec- 
ords of 441 Lincoln School pupils. The number of tests for each indi- 
vidual varied from two to eight, the ages from 3 to 18 years, 1.Q.’s 
from 80 to 185 points. The tests were administered by 39 different 
examiners. . . . The median change in I.Q., when all possible pairs of 
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I.Q. were compared, was found to be .96; the limits of the middle 
50 percent of variation, —3.50 and 5.71 points, Q, 4.61 points. The 
coefficient of correlation for all pairs was .814.007.” The median 
time-interval between tests was slightly over one year. Little difference 
in the constancy of retests was found for increasing time-intervals up 
to three years; but retests made after three years showed slightly 
greater discrepancies with the first tests. 


185. Irwin, Elizabeth A., and Marks, L. A. Fitting the School 
to the Child. New York, 1924, 339 pp. 


Normal children yielded I.Q.’s within 9 points of original I.Q.’s in 
90 percent of 322 retests (p. 99), but only 53 percent of 30 neurotic 
children varied as little as 9 points. 


186. Johnson, B. J. Mental Growth of Children. Dutton, 
N. Y., 1925, 160 pp. 


Ten children first tested when three years old had an average gain 
in Stanford Binet I.Q. of 120 minus 112, or 8 points, with an average 
interval between tests of seven months. After an average interval of 
14 more months, the average I.Q. was 123. 

For 28 children first tested when four years old, the comparable 
figures were five points in 16 months, minus one point in 12 more 
months; for 15 first tested when five years old, plus five points in 12 
months, minus one point in 11 more months. For 11 first tested when 
six years old, the same average was retained at the first and second re- 
test (similar intervals). 

The writers feel that their data indicate “greater inconstancy” of 
I.Q.’s in the early years. 

Comment: Data on this point do not discriminate adequately be- 
tween the possible ‘instability’ due to nursery-school training and that 
due to unreliability of the tests. 


187. Kuhlmann, F. ‘‘The results of repeated mental reéxam- 
inations of 639 feeble-minded over a period of ten years.’’ Jour. 
Appl. Psych., 5: 1921, 195-224. 


The following average yearly decline in the I.Q.’s of children 7 
to 20 was found for each of four grades of feeble-mindedness: 


Decline Approximate I.Q. Range 


Borderline’. neva csea <p ar 2.19 points Over 74 
uM KapROVarss S Aly aio Hloigaise HBO 1.21 50 to 74 
Impbecilesey-c sire lepden sreescks 1.04 25 to 49 
Ieheyeh eG Aly canbpctom ae ais woke 37 0 to 24 


The I.Q.’s used in these computations were corrected for the faulty 
standardization of the scale. 2 

Of the idiots 99 percent changed less than six points a year; of 
the imbeciles 98 percent changed less than six points; of the morons 98 
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percent changed less than nine points; of the borderline 94 percent 
changed less than Io points. 

The author shows the data upon decrease of I.Q. with age to be 
the natural consequence of an assumption of a decrease with age in 
the yearly increment of mental growth. 


188. Poull, L. E. ‘‘Constaney of I.Q. in mental defectives, 
according to the Stanford Revision of Binet tests.’’ Jour. Educ. 
Psych., 12: 1921, 323-324. 


One hundred twenty-six defectives were tested and retested on the 
Stanford Binet at intervals of six months to three years. On the first 
test the ages ranged from 4 to 28 years, and the I.Q.’s from 20 to go. 
The average increase on the second test was 1.28 points, and the mid- 
dle 50 percent of changes extended from a decrease of 3.3 points to an 
increase of 4.8 points. 


189. Rosonow, C. ‘‘The stability of the intelligence quotient.’’ 
Jour. of Delinquency, 5: 1920, 160-178. 


(See tabled summary by Dickson, No. 177.) 


190. Rugg, H., and Colloton, Cecile. ‘‘Constaney of the Stan- 
ford Binet I.Q. as shown by retests.’’ Jour. Educ. Psych., 12: 1921, 
315-322. 


(See tabled summaries by Dickson, No. 177, and Freeman, No. 179.) 


191. Rugg, L. S. ‘‘Retests and the constancy of the I.Q.’’ 
Jour. Educ. Psych., 16: 1925, 341-343. 


(See tabled summary by Freeman, No. 179.) 


192. Terman, L. M. The Intelligence of School Children. 
Boston, 1919. 317 pp., esp. 188 ff. 


Some 315 children retested in the vicinity of Stanford University 
yielded 435 comparisons on the Stanford Binet. The intervals between 
tests ranged from one day to seven years, and the age at the first test 
ranged from three to over 15 (8 cases). The correlation for the entire 
group of retests was .933. When differences in retest were tabled by 
age at first test and by interval between tests, the interval seemed to 
have no efiect upon constancy. ‘The age at the first test seemed to 
have little effect upon constancy at most ages, but the group tested first 
between three and six showed slightly more variability than other 
groups on the retests. 

(See also the notes on Terman’s data in the tabled summaries taken 
from Dickson, No. 177, and from Freeman, No. 179.) 
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9. Effect of Coaching 


What are the limits of mental performance when training of a 
highly intensive and specialized nature is given? 


193. *Bishop, O. ‘‘What is measured by intelligence tests?’’ 
Jour. Educ. Research, 9: 1924, 29-38. 


Sixty-four pupils from Grades VII to X were divided into the fol- 
lowing groups: Groups A and B carefully matched from seventh and 
eighth grades on the basis of the first score on Otis Group Scale, Form A, 
chronological age, sex, apparent maturity, and habits of work; Groups 
C and D formed from ninth and tenth grades by similar pairings. 

A series of 10 lessons similar to the 10 pages of the Otis Test was 
prepared by the author. The first five lessons were taught to Group A, 
and the last five lessons to Group B. All lessons were taught to Group 
C, and none to Group D. About 150 minutes was spent by Group A 
and B pupils and “more time” by Group C pupils. 

After coaching, Group A showed “a total gain in scores of 376 points 
on the first five pages of the Otis scale as against 42 points on the last 
half.” This is equal to a median percentage gain of 40 percent on the 
first half and 6 percent on the second half. Corresponding percentage 
figures for Group B (which had been trained on the second half) were 
31 percent and 15 percent. Group C “made a total gain of 672 points, 
as against a total gain of 243 points” by Group D; 1t.e., there was a 
median percentage gain of 30 percent for Group C against 11 percent 
for Group D. 

Comment: It would have been desirable to report these gains in 
terms of some measure of dispersion instead of “percent of mean score” 
or total points gain. 


194. *Chapman, A. E. ‘‘The effect of school training and spe- 
cial coaching on intelligence tests.”? Forum of Educ., 2:1924, 
172-183. 


The following battery of tests was given in two forms to 300 
elementary-school pupils between II-o and 12-11: 

(1) Five arithmetic questions from Alpha 

(2) Ten from Burt’s reasoning 

(3) Five from Alpha practical judgment 

(4) Ten from Alpha number series 

(5) Five from Alpha analogies 

(6) Ten from Alpha disarranged sentences. 

After the first composite test, coaching was given I5 minutes a day 
for two weeks, during which period “the teachers dealt with the ques- 
tions in the test paper already set to their pupils, discussing with them ~ 
the methods of solving them.” 
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The §S.D. on the first composite test for a single age group was 9 
to I0 points. The average gain of the coached group was 9.4 and of 
the uncoached group was 2.3, leaving 7.1, or half a S.D., in favor of 
the coached group. 


195. DeWeerdt, Esther H. ‘‘The transfer effect of practice in 
related functions upon a group intelligence test.’? School and 
Soc., 25: 1927, 438-440. 


Forty-five fifth-grade children of a test group and 45 children of a 
control group were given the Illinois Intelligence Examination twice, 
12 days apart. The test group was given practice on 11 consecutive 
school days for an hour a day on the following features of intelligence 
tests: symbol-digit, addition, reading (Chapman-Cook speed test), can- 
cellation, multiplication, same-opposite, multiplication by substitution. 

The percentage gains upon the portions of the intelligence test after 
this training were as follows: 


Percent Gains 
Test Group | Control Group 
FATIRIO PICS mye AS caste eae ieke ote el cetera cts, oun ste. hs 115 68 


ATIbOMEeLIC problemsecne sey. crs cies 2 sie lls a=: 8 8 
Sentence) VOcabularviees cris alee celesie cecle to 26 13 
SUbstioUtion 1 Werte see dees e ee aeyese ees. 30 31 
Werbalkincentitye ert: tee ccrte ee on elsieis oes 13 16 
Arithmetic moenuity =. .)ec cei ss oc eelss sis 12 15 
SY NOMVM-ANLON VAs oie ee cleats ee elelsiaje cle > 48 21 

PA VCTAS OUD AlN nine laraiele «Miettie il claisy sieis cies tae sic 36 25 


The author concludes that analysis of results “indicates that the 
group intelligence test is a much more stable and reliable instrument 
for measuring group averages than its critics at times are willing to 
admit.” 

Comment: Since no measures of dispersion are published, the 
percentage gains are all the evidence we have. On the face of the 
figures, the advantage of the test group seems rather slight. 


196. *Odell, C. W. ‘‘Some data as to the effect of previous 
testing upon intelligence scores.’’ Jour. Educ. Psych., 16: 1925, 
482-486. 

(1) Out of 5283 high-school seniors, those who had had an in- 
telligence test before scored only two points higher on the Otis Self- 
Administering Test than those who had never had an intelligence test 


before. 
(2) Army Alpha was administered to 44 pupils of ages 11 to 16, 
“after which several weeks of practice and coaching upon material 
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similar to that obtained in the test was given, accompanied by occa- 
sional retesting with the several forms of Alpha.” Soon after the con- 
clusion of the experiment (“a few days or even weeks later’) Army 
Alpha was repeated and again “more than two years later.” Because 
of uncertainty as to adult mental level, I.Q.’s were computed on three 
bases: (a) exact C:A.; (b) 14 years C.'A., and (c) 16 years C.A,, 
with the results here shown: 


Basis of Computing Adult I.Q.’s 
Exact C.A.| 16 Years 14 Years 


Beforestrainingas ger ecko ces peice 119 119 120 
Soon aver wralming see ss iisreere selec 154 154 157 
Overs2iyrs alters sateonte cre tana aie eee 115 118 130 


Thus, the median I.Q. was slightly less at the end of two years than 
at the beginning by the first two methods, and only Io points greater 
when using the 14-year basal age. The increase shortly after training 
is 35 to 37 I.Q. points, depending upon the basal age employed. 

Comment: ‘The study shows unquestionable immediate effects of 
training. Effects after two years are not clear, since Alpha norms for 
unselected children in their teens are not available. 


197. *White, W. ‘‘The influence of certain exercises in silent 
reading on scores on the Otis Group Intelligence Test.’’ Hlem. 
School Jour., 23: 1923, 783-786. Also in Ed. Admin. and Super- 
vision, 9: 1923, 179-182. 

Various groups were given the Otis Test twice (at intervals of 36 
to 59 days). During these intervals, reading drills, constructed to in- 


crease speed of reading, were given to all except 59 eighth-grade pupils 
who served as a control group. The following results were tabulated: 


N Group Test 1 Test 2 Average 
Gain 
85 | 4th, 5th, and 6th grade............. 72.3 80.9 8.6 
AS Sth grade veterse coe cee theists lohal er ecoke consi 120.2 137.0 16.8 
D4at Sthigrad Ge racer ticseioct-/ee clower oie te 99.4 122.6 23.2 
GOP ledtbigradewereerens ce nek seve cis ie oie terers 96.2 121.2 25.0 
HOveControl Sthyorad ea wtawciee tani ate seven. 110.0 119.6 9.6 


The author finds also, in further statistical treatment, a consider- 
able inverse relation between percentage of increase and the Otis index 
of brightness, from which he concludes that abilities have been equal- 
ized through training. 
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Comment: It is unfortunate that no measures of dispersion are 
given, but the net gain of drilled groups seems to be six months to a 
year of normal growth. 

The inverse relation between percentage increase and index of 
brightness might be due simply to the fact that in the duller groups 
the numerators from which the percentage of increase is calculated are 
smaller. Growth is not often found to be a multiplying factor times 
“present level” and could hardly be expected to be such a factor here. 


II. AcHIEVEMENT 
10. Family Resemblance 


This topic is significant in the same way that family resemblance 
in intelligence is significant, but the data are no more crucial in 
this case than in the other. It is worth pointing out that here, as 
in the case of the intelligence studies, not a single one of the in- 
vestigations we summarize reports correlations which have been 
corrected for attenuation. Hence, it should be borne in mind that 
the true resemblances are in practically every instance larger than 
the figures, based as they are upon scores from imperfect measuring 
instruments, would indicate. 


198. Cobb, Margaret V. “‘‘Preliminary study of the inheri- 
tance of arithmetical abilities.’’ Jour. Hd. Psych., 8:1917, 1-20. 

Fight families including both parents and children over 14 (of 
which there were 20) were given the Courtis standard tests. The scores 
were measured in S. D. units derived from tests on 200 university under- 
graduate and graduate students. 

Correlations between scores were: 


Child and | Child and Child and 


Mid Like Unlike 

Parent Parent Parent 
FN ACIDION Sere eres eine colons alere se ste 28 54 05 
NuUbtracclOM srt orate tt steel teteetes 01 27 —.11 
Maaltiplication verre tet otters ts nein. tres <> .55 79 .24 
PDI VISIOU MR eI Ged ole ort Si euctaretety ene Ball .44 .08 
Copying higuresceeeen tee eee erea steel: 45 .65 13 
AV CT AS OUR PN ie ticle elaieleielevsususle pei stel ice 32 54 08 


Comment: ‘The author attempts to interpret the correlations of 
the child’s scores with those of the parent most like, and most unlike 
himself, upon the theory of Mendelian segregation. However, the as- 
sumption that graduated traits, such as arithmetic abilities, are due to 
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single Mendelian genes seems unwarranted, and hence the significance 
of the particular treatment in question is not clear. 


199. Dexter, Emily 8. ‘‘On family resemblance beyond the 
first degree of relation.’’ School and Soc., 19: 1924, 501-502. 


Scholarship averages of 201 cousin pairs who had attended the Uni- 
versity of Wisconsin correlated .21 + .04; of 64 cousin pairs who had 
attended Agnes Scott College, .19 = .08; of 96 pairs of uncles and aunts 
versus nephews and nieces at Wisconsin, .06 = .07; of 18 pairs of aunts 
and nieces at Agnes Scott, .15 = .16. 

When tested by two or more tests out of Haggerty, Detroit, N. I. T., 
Stanford, or Herring-Binet, the average scores of 135 pairs of cousins 
in a Madison elementary school correlated .22 = .06. 


Comment: It is a questionable procedure to lump results from 
different types of tests in a single correlation, as was done in the last 
treatment. The method of obtaining average scores for this treatment 
is not explained. 


200. Earle, E. L. ‘‘The inheritance of the ability to learn to 
spell.’’ Columbia University Contrib. to Phil., Psych. and Educ., 
11: 19038, No. 2, 41-44. 


An unstandardized spelling test was given to I91 pairs of siblings 
in a public school and 196 pairs in a parochial school. The author 
thinks the latter represents a more unselected group in the upper grades. 
The mark assigned to each child “was his distance above or below the 
average for his grade and sex in terms of the average deviation of 
that grade and sex as a unit.” 


Product-Moment Correlations 


Public school r N 
PR TOLMETADIOLNED 1215, ovate siecle cesta ore a's aie 22 45 
MDISLEL=GISEOR. vrayata Leloies isis ako steloltnsis scare ake tele ae 62 
Broder sisterent <) axes civerce sielawlete steve s .22 84 
Genel alismters has idtvt.c shore a e's wisinto o aicle'e 6% 25 I9I 

Parochial school r N 
BrOtMersDrOwneh vitesse sores aaie cies teiareeton .56 48 
SISEETESISLOL ee pelenstersieielshaneisloneneederaielsiaie cists .48 44 
IBTothereststeturnmmeaiererrestcis srl sieclerstele .50 104 
GEES AN rete crates slate e's ots asin cvietertle sia/eiole'a as “SE 196 


Comment: ‘The study has historical significance, but the method 
of computing the scores makes precise interpretation impossible. 

201. Gowen, J. W., and Gooch, M. ‘‘The mental attainments 
of college students in relation to the preparatory school and ~ 
heredity.’? Jour. Educ. Psych., 17: 1926, 408-418. 
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The following correlations between sibling pairs graduated from 
the University of Maine since 1921 or in college during 1922-1923 were 
based upon marks in college courses: 


Subject r Rae N 
ROUSSE MNT tats aiee erst reo e cls .04 279 
Ciieniisttyecuerc ce: cecindes fee ae .16 05 159 
PAID iain aoe vale en saiiriie's von .16 .06 103 
Analytical geometry ........... (22 .07 79 
Wdvancedsbrench 2.0 s..ce.s 00.10 10 39 
Elementary German... ....5...5 .41 sI 24 


The average for brothers with brothers was .187 for the first four 
subjects (the only like-sex comparison with sufficient numbers for com- 
parison). Since the average r for all sibs on the same four subjects 
is .I7I, it seems justifiable to lump the sexes. 

202. Hildreth, Gertrude H. The Resemblance of Siblings in 
Intelligence and Achievement. New York, 1925, 65 pp. (Teachers 
College, Columbia University, Contributions to Education, No. 
186.) 

(See summary No. 11.) 


203. Huestis, R. R., and Otto, T. P. ‘‘The grades of related 
students.’’ Jour. of Heredity, 18: 1927, 225-226. 

“Grade-point” scholarship averages for the sophomore year at the 
University of Oregon were secured for a hundred pairs of siblings. 
The correlations obtained were: 


r Pe N 
ISLETS avalos here wicisretee aisle dao os 61 .07 38 
ISTOULCIS Meters eiels dees va tcae 5 she 74 .06 26 
iBrotherssanudasisters sprees. sci $ 04 iit 36 


A possible reason suggested to account for the higher values found 
for the like-sex coefficients than for the brother-sister coefficient is that 
“brothers are more liable than their sisters to be partially self-supporting 
and more liable to be interested in athletics.” 

Comment: The large size of the like-sex coefficients reénforces 
the author’s suggestion that external conditions have affected the sibling 
similarity, since a correlation explicable by heredity alone would not 
have a value much above .50. 

204. Peters, W. ‘‘Ueber Vererbung psychischer Fahigkeiten.’’ 
Fortschr. der Psych., 3: 1915, 185-382. 

This study deals with the final sets of elementary-school marks of 
three generations of village and country people. Records were obtained 


for 1162 children, 344 pairs of parents, 177 grandparents, and II great- 
grandparents. The author attempted roughly to equate on a single scale 
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the marking systems of different localities. Marks were also equated 
for sex differences. 


The following tabulation shows that, even when parents have the 
same marks, the children have better marks when grandparents are 
highest. 


Marks of Marks of Marks of 
Parents Grandparents Children 
I-I 1.25 1.19 

1.94 1.71 
2-2 2.09 1.97 
2.70 2:23 
3-3 2.38 2.00 
3.50 2.70 


Taking the individual marks in four school subjects and construct- 
ing a fourfold table for mid-parent and offspring by referring each 
mark to the average of marks in all subjects, Yule’s coefficient q was 
-455. Pearson’s contingency coefficient .31, his mean-square contingency 
coefficient .36, and his fourfold r .37. Other q coefficients (based on all 
marks, but not on all marks averaged for each person), were: 


Bathérs.and* childrens.sa wane asics fats winks 34 
IMotherkandychuildrenteei cemeteries cra .48 
Mid-parentrandacontman nvr etn ie meee arent .40 
Mid-parentVand)datohteta:a..1. acetic 51 
Fatherraudusoties certo ssvaecte cites an aatierens BE 
Motherkand@sontiac: cscs otlon urs Setgtettett 42 
Fathersand) davctiter 0... rceaia ne oe eek 36 
Mothersandsdanchtentzni nme ce atvetetecrre 55 
Child and average of 4 grandparents...... .37 (151 children) 
DIDI OS Rae Seall ce ee ep ecete cee 53 
IB TOLMELS Meercrs sesia.e oteers sit eset eraretetsts otro mice 58 
SSISLORS fey) apts wiclapncew ie’ adios Sracehe ore Cane ean ae ope 


Data are also given upon correlations between siblings on a few 
simple mental and motor tests. These are not summarized here. 


Comment: ‘The facts of this study have historical importance and 
establish unquestionably some family resemblance in school accomplish- 
ment. But the coefficients of family resemblance can be taken as little 
more than suggestive, because (1) school marks are always unreliable 
as individual measures, (2) the correction for varying standards of 
marking in different schools was only approximate, (3) Yule’s coefficient 
generally gives values higher than a product-moment r. 


205. Schuster, E., and Elderton, Ethel M. ‘‘The inheritance ~’ 
of ability.’’ Eugenics Lab. Memoirs, 1: 1907, 1-42. 
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(1) Evidence of Oxford Class Lists. Number of fathers and sons 
and brothers who obtained first, second, third, and fourth-class honors, 
pass degree, and no degree were tabulated. All possible pairings were 
used. 

Contingency coefficients for several thousand pairs of fathers and 
sons completing their work during the 19th century clustered around 
.30; for 4000 pairs of brothers, contingency coefficients were about the 
same, but four-fold correlations were approximately .40. 

(2) Harrow and Charterhouse lists. “Position” in the school was 
found for pairs of brothers at equivalent times in their school careers 
in Harrow, and in Charterhouse. Between 1000 and 2000 pairs entered 
into various treatments. 

Harrow pairs dated from 1858 “onwards.” 

C, varies from .37 to .44; mean .395 
C, varies from .38 to .46; mean .415 
r varies from .25 to .49; mean .398 


206. Starch, D. ‘‘Inheritance of abilities in school studies.’’ 
School and Soc., 2: 1915, 602-610. 


(1) In 63 families each having two or more children in a certain 
school (and yielding 111 sibling pairs when all possible pairs were used) 
the rank-order correlation for school marks during one year was .52. 

(2) Standard tests given to 57 pairs of siblings in two elementary 
schools yielded the following correlations: 


r N 
See eeeees ols ld as 5'5'G 5 sista wee wea'elg 6 .21 57 
DRG aii Renal ate ar aye! ovecs's fa) 01,60 8, oteiers le ,crs .49 57 
Wisitinom (Speed heat aes leisietelersieisrstetes ote .18 24 
Wikiannnte (Cahier) 5b al BENE neu Biees Gd .06 24 


Average marks over a period of three to eight years computed for 
38 elementary-school children of 11 families gave the following cor- 
relations for sibling pairs: 


r N 
BA PIVAIMIGUIC rote Coase sis'e sid ro/ sictale ele aie aig.s 32 54 
S pellitiomeme yer creo etetee clas «creel nreteie ers oa 54 
UR Cal Gi 11 up temete me ttereromiore aiciats ieee sileteccl er siere\s hil 54 
LARTER. 6 Caridad o auc OOeD oO Boor 24 54 


207. Starch, D. ‘‘Similarity of brothers and sisters in mental 
traits.’’ Psych. Rev., 24: 1917, 235-238. 


Eighteen pairs of adult siblings, aged 19 to 32, at the University 
of Wisconsin were tested twice on two different days on a battery of 
achievement tests and simple mental tests. The average of the sepa- 
rate correlations for all the achievement tests (of which there were 
10) was .42, of the mental tests (including memory, cancelling a’s, can- 
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celling geometric forms, and tapping) .38. The correlation based on 
ranks in all tests combined was .73. 

The author suggests as an argument for heredity that resemblance 
is apparently no greater in traits affected by school work than those not 
so affected. 

Comment: It was not stated how the ranks in all tests were com- 
bined for the correlation .73. 

Since the group used was undoubtedly highly selected, and since 
their variability is not reported, the actual values of the correlations 
can be taken only as suggestive. 


11. Contribution of Intelligence to Achievement 


The countless studies showing correlations between intelligence 
and achievement varying from close to zero for handwriting up to 
-70 or .80 for subjects like reading have not been summarized here 
because of limited space, and because they provide no means within 
their own data for determining whether the observed correlations 
are due to the influence of intelligence upon achievement or of 
achievement upon intelligence. The few studies contained in this 
section deal chiefly with the limitations upon achievement set by 
mental levels of various grades. 


208. Burt, C. Mental and Scholastic Tests. London, 1921, 
432 pp. 


(See Summary, No. 125.) 


209. Courtis, S. A. Why Children Succeed. Detroit, 1925, 
271 pp. 


A multiple correlation computed for measures of 165 boys between 
10.5 and 13.5 upon the Stanford Achievement Test with two group 
intelligence tests, number of semesters of school attendance, and chron- 
ological age was .g1. The standard error of estimate was 5.46. 

Since this 5.46 is about 10 percent of the average Stanford Achieve- 
ment score of the pupils, the author concludes that the factors under 
discussion “account for 90 percent of the changes in children.” Since 
“the standard deviation of the changes in the achievement test scores 
on a retest after an interval of two days was a little under 7 percent 
of the mean score,” he concludes “that all the remaining factors which 
influence a child’s score have a combined general effect which is not 
more than 3 percent of the total score.” 

Comment: The statistical methods by which these conclusions. are. 
reached cannot be defended. The division of total score by standard 
error of estimate to secure “proportional contributions” is meaningless, 
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since (a) there is no inherent relationship between a mean and a stand- 
ard error of estimate; (b) the mean of a test is an arbitrary distance 
from zero and can be raised or lowered at will, by inserting or deleting 
items easy enough to be passed by every pupil; (c) the sum of the 
separate standard errors of estimate for independent factors is inher- 
ently greater than the standard deviation of a criterion itself; (d) 
there is nothing in the data themselves to warrant the assumption of 
a causal influence of the factors in question upon achievement. The 
study contributes nothing beyond establishing certain correlations be- 
tween variables. 


210. Courtis, S. A. ‘‘The influence of certain social factors 
upon scores in the Stanford Achievement Tests.’’ Jour. Educ. Re- 
search, 13:1926, 311-324, and 14: 1926, 33-42. 


Using data similar to those enumerated in the abstract No. 209 of 
Why Children Succeed, the author modified the statistical treatment 
originally applied and reached the conclusion that environmental factors 
account for 22 percent of the variation in school achievement of chil- 
dren 10.5 to 13.5 years old. This percentage value was arrived at by 
considering the standard error of estimate of the Stanford Achievement 
scores of 443 children after a multiple correlation of achievement with 
intelligence, age, and semesters spent at school had been computed. The 
standard error of estimate was found to be 44 percent of the total 
standard deviation of achievement scores; and since “chance variation” 
itself amounts to approximately 22 percent, 44 minus 22, or only 22 
percent of the variation “is the maximum that can be predicted from 
environmental factors.” 

The author also attempted to appraise the effect of Sunday-School 
attendance upon the achievement level of the pupils and reached the con- 
clusion that for children equal in intelligence, age, and school attend- 
ance, “The effect of Sunday-School attendance upon children coming 
from religious homes is to raise their Stanford scores approximately 
5 percent (3.74 points) of the total value.” 

Comment: ‘The effect upon achievement attributed to environment 
is said to be 22 percent in this study as compared with 3 percent in the 
previous study. But the modified treatment used here appears to in- 
troduce new errors in the underlying statistical theory. Standard 
deviations and errors of estimate cannot be added and subtracted to 
yield the type of interpretation sought by the author, because standard 
errors of estimate for various unique factors do not combine by simple 
addition to give the total standard deviation of a group. 

The method by which the second conclusion is reached is not clear 
throughout, but some kind of environmental force appears to be at least 
a possibility. It is not possible to say whether this is religious or other- 
wise, since Sunday-School attendance may or may not be correlated with 
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other factors in environment which are actually responsible for the 
relationship found. 

Commenting upon the small percent contributed by the Sunday- 
School, the author makes the statement, without clear foundation, that 
his results prove “not that the powers of religion are a negligible quan- 
tity, but that under present conditions it is only the stray individual 
who actually makes use of the rich gifts religion has to offer.” 


211. Dickson, V. EK. ‘‘ What first-grade children can do in 
school as related to what is shown by mental tests.’’ Jour. Educ. 
Research, 2: 1920, 475-480. 


Twenty-five children with M.A.’s from three to six, and border- 
zone I.Q.’s, had had from two to four terms of previous schooling. After 
two and three-quarters terms under an especially good teacher in a spe- 
cial class, only six of the 25 could read in an easy primer, and these 
could read less well than the pupils of a normal class who had had only 
three-quarters of a term of instruction. 

Of 29 children with slightly better I.Q.’s, and M.A.’s from four to 
six and who had had one to three terms of schooling, only seven learned 
to read approximately as well as the average high-first pupil after two 
and three-quarters terms of special work. 

By way of contrast, only five out of 42 children testing normal or 
above failed at the end of the first grade; and these failed, according 
to their teacher, because of irregular attendance or excessive timidity. 

Of 30 children who tested below 6 years M.A. and who had had 
one to two terms of previous schooling, only two were promoted to the 
high-first grade after two terms of special work. 

Comment: ‘The study shows clearly the limitations upon achieve- 
ments set by intelligence, but does not provide a means for distinctly 
separating nature from nurture. 


212. Franzen, R. H. The Accomplishment Ratio. New York, 
1922, 59 pp. (Teachers College, Columbia University, Contribu- 
tions to Education, No. 125.) 


Of a group of 200 children serving as subjects for the author’s 
experiment, 48 took the following battery of achievement tests four 
times during a period of two school years: 

1. Woody-McCall Mixed Fundamentals (given only twice) 

2. Thorndike Reading Scale Alpha 

3. Thorndike Visual Vocabulary Scale, Az 

4. Kelley-Trabue Completion Exercises in Language 
They were also tested by the Stanford Binet at the beginning of the 
experiment. ree 

Pupils were classified in each school subject on the basis of the 
achievement tests; their abilities in these subjects were pushed, and 
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promotions were made as often as necessary to maintain proper classi- 
fication. Correlations of I.Q. with achievement rose from November, 
1918, to June, 1920, as follows: 


LOvwith® Vocabulary Quotient... vo. .ccescceceedeee 272; tO Or 

ft Om waters Caciney OUOtIeHt tim cates «ces ehetace.e .64 to .79 

I.Q. with Kelley-Trabue Completion Quotient....... .63 to .84 

I.Q. with Arithmetic Quotient (during second year 

CRETE E SOC) iin Wire ees stork Sire aera VOR oa es .46 to .73 

The theoretical limit of these correlations (based as they are upon 
fallible tests) is about .95. 

The author concludes that “intelligence is far and away the most 
important determinant of individual differences in product.” 


Comment: It is highly significant that better prediction of school 
achievement from intelligence could be obtained for results two years 
after the Binet tests than for results at the time of the Binet tests— 
under the condition that opportunity was “equalized” during the two 
years. This indicates that the Binet tests are probably measuring a 
factor in capacity to do school work which is to some degree independ- 
ent of the results of school work. 

One possibility not made clear in the study should be known before 
the results can be accepted without question. This concerns the relative 
effort that was made to push abilities before and after the school achieve- 
ment of individual pupils became commensurate with their intelligence. 
If there were a difference between motivation before and after that 
point, correlations might have been due to that difference rather than to 
“equalized opportunity.” 


213. Merrill, Maud A. On the Relation of Intelligence to 
Achievement in the Case of Mentally Retarded Children. Balti- 
more, 1923-1925, 100 pp. (Comp. Psych. Monographs, 2: No. 10.) 


Tests given to several hundred children, including subnormal, un- 
selected, and gifted pupils, offered a basis for the following conclusion: 
“When retarded children are matched mental age for mental age by 
normal children of average I.Q., the normals attain a slightly higher 
average achievement score on a battery of educational tests, including 
tests of reading, writing, arithmetic, and spelling. The retarded pupils 
start a little ahead of the normals at M.A. seven, but fall behind as the 
comparisons proceed up to M.A. eleven, except on the writing test, on 
which they keep above the normals up to eleven and then fall only very 
slightly below.” 

At M.A. nine the retarded excel the gifted by about a year of E. A. 
They do not at M.A. eleven, however. At this level both the gifted 
and the retarded are from two to ninth months below this M. A. on edu- 
cational subjects, except reading, on which the gifted are about five 
months in advance of their C. A. 
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12. Effect of Length of School Attendance 


Investigations in this field are significant when neutral results 
are found; but when plus relationships are found, the possibility 
of selection cannot usually be satisfactorily measured and elimi- 
nated. Studies using controlled experimental methods over a term 
of years are needed. 


214. *Corman, Oliver P. Spelling in the Elementary School. 
Boston, 1902, 98 pp. 


When spelling was eliminated from the curriculum for several hun- 
dred Philadelphia school children during three years, it was found that 
the children did as well in spelling measured on a composition test, on 
lists of dictated words, and on a 15-minute test of spontaneously writ- 
ten words as the medians for the remaining school children of the city 
who had had the ordinary amount of drill. On a term examination 
paper required of all the city schools the undrilled children ended up 
four points below the city medians, but this difference was thought 
small as compared with fluctuations of 40 points in the medians of vari- 
ous classes of the same grade throughout the city. 

Tabulating the time per day devoted to spelling in 11 schools (which 
varied from Io to 40 minutes) it was found that “no relation between 
[time allotments] and the spelling percentages seems to obtain.” 

The conclusion is that “an influence, the suspension of whose opera- 
tion for three years is not plainly manifested—is of so little importance 
as to be practically negligible.” 

Comment: Possibly, the slightly inferior performance of the ex- 
perimental schools on the term examination may have been due to the 
type of the test, which was one set by the superintendent’s office. It 
may have been based exclusively upon words actually studied in the 
text book. " 


215. Gordon, Hugh. Mental and Scholastic Tests Among Re- 
tarded Children. London Board of Education, Ed. Pamphlets 
No. 44, 1923, 92 pp. 


(See summary in section on schooling and intelligence, No. 127.) 


216. Kelley, T.L. The Influence of Nurture upon Natwe Dif- 
ferences. New York, 1926, 49 pp. 


The author wishes to find out whether unevenness in school abili- 
ties tend to increase or decrease under the influence of the schooling 
process. Scores upon the eight tests from the Stanford Achievement 
battery were used in comparisons made between eight-, eleven-, and_ 
fourteen-year-olds, all ‘at age’ for their grades. Assuming that the con- 
tribution of nature to the unevenness, or ‘idiosyncrasy,’ of an individual 


? 
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upon any pair of school abilities remains constant throughout life, and 
that the influence of nurture upon idiosyncrasy is proportional to the 
rate of growth of school abilities, the author sets up equations for 
establishing the relative contributions of nature and nurture to adult 
idiosyncrasy. 

He reaches the result that the only school abilities of the battery 
whose comparisons with the other abilities show an average increase of 
idiosyncrasy due to nurture are science information, history and litera- 
ture information, language usage, and spelling. Nurture decreases the 
average “innate distinction” of paragraph meaning, sentence meaning, 
word meaning, computation, and arithmetic reasoning—each compared 
in turn with all of the other seven abilities. 

Kelley’s technique for determining idiosyncrasy is also applied to 
the Stanford Achievement scores of a group of eight-year-old gifted 
children. The results indicate that the idiosyncrasies for all the pairs 
of tests are in gifted children reduced during the three years between 
age eight and age eleven. 

The conclusion is drawn, aed its implications deplored, that school- 
ing more frequently tends, in both normal and gifted groups, to level 
talents which are natively distinct than to accentuate idiosyncrasies. 

Comment: The author admits that the study is based upon assump- 
tions some of which are in need of corroboration. The following ques- 
tions would apparently need to be answered before the conclusions 
could be considered as firmly established, particularly in the precise 
numerical form in which they are presented: 


1. Would idiosyncrasies surely remain constant throughout life 
if it were not for the influence of nurture, or might some changes in 
unevenness occur that were due to nature or to different rates of devel- 
opment of different abilities? 

2. Is the effect of nurture at any given time proportional to the 
rate of mental growth? 

3. Are the proportions of children promoted or failed in school at 
different ages real functions of the relative variabilities of children of 
these ages in school abilities? 

4. Are standard (z) scores adequate measures of children whose 
ages span an entire year if the norms used are those for children just 
six months away from a birthday—especially if the children are an age 
at which rapid growth is taking place, and more rapid growth in some 
abilities than in others? 


217. Odell, C. W. ‘‘The effect of attendance upon school 
achievement.’’ Jour. Educ. Research, 8: 1923, 422-482. 
Several thousand school children from Grades I to VIII were given 


intelligence tests of various types and a battery of standard achievement 
tests, once at the beginning of a semester and again towards the close. 
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Their school marks and percentage attendance during the semester were 
also noted. 

The correlation of attendance with increase in average achievement 
for 13,700 cases was .o1; with average school mark (22,800 cases), .14; 
with intelligence quotients (3,800 cases), .06; and with attendance for 
the following semester (5,600 cases), .19. 

Comment: The median attendance was reported as 94 percent. 
Less than 2 percent attended as little as 81 to 90 percent of the time, 
and only a fifth of 1 percent attended 10 percent of the time or less. 
Thus, the spread was so slight that differences in effect on achievement 
had little chance to show up. ; 


218. *Rice, J. M. ‘‘The futility of the spelling grind.’’ Forum, 
23: 1897, 163-172, 409-419. 


On a series of dictation tests and composition tests in spelling given 
to over 25,000 elementary-school children a table is published of aver- 
age scores of classes in different cities with respect to time allotted to 
spelling. No r’s were computed, but the author believes his data prove 
“that the results obtained by 40 or 50 minutes’ daily instruction were 
not better than those obtained where not more than Io or 15 minutes 
had been devoted to the subject.” 


Comment: By inspection, the conclusion drawn from these tables 
seems justified. 


219. Snedden, D. ‘‘A note on the relationship between the 
number of months of study and proficiency (geometry).’’ Jour. 
Educ. Psych., 17: 1926, 57. 


Among applicants to Cooper Union night courses 441 boys were 
given Parts 1 and 2 of the Hawkes-Wood Placement Test in Plane 
Geometry. They noted on their test papers the number of months they 
had studied plane geometry. 

The correlation of months studied with score was .22 (or .24 if 
corrected for the unreliability of the test). The mean number of 
months geometry was studied was 9, with an S.D. of 3.6. 


Comment: No allowance was made for the amount of time elapsed 
since the boys studied geometry. 


220. *Terman, L. M. Genetic Studies of Genius, I. Stanford 
University, 1925, 648 pp. (Pp. 304-305, ‘‘The influence of attend- 
ance upon educational accomplishment.’’) 

The correlations between number of semesters of school attendance 
and achievement quotients for tests in the Stanford Achievement bat- 


tery as computed for 109 gifted children between Io and 11 years old, 
were as follows: 
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r Lee 
Mpelligteer. cts es te ccs uy dé raee eee s .04 .07 
Fei CQ Uti tt 1 gh s aiaae tore cc aoren oe sedis a .00 07 
ECAC Siete sleuo cie'sie:s\aisie wa Satan) Gals oe —.OI .07 
MN TIL IETECT Ochs do's. se csn crduare diese eve Bias «his 13 .06 
Cees eee et ersie otaea.a¥ere: scala wileo.o Weve Seavec ye .O1 .07 

On the other hand I.Q. correlated with these subjects as follows: 

r P.E. 
SU CUMCM OME Pes. tk o-s  cSiercts coke cameos [33 .07 
LOTS GIy se orci sishsle-aesele sie: cu, o16 oroners .46 .05 
PRCA CII OME ania y) cols 8 oa-d auctete ee ere 34 .06 
PA TAC IME LIC ra et cesta ce aha sit wap sy bos oot .26 .06 


For this group, school attendance ranged from two to six and one- 
half years, with an S. D. of .86 years. 


13. Effect of School Conditions, Size of Class, or 
Teaching Ability 


There is little material available upon this topic, but what 
there is runs contrary to tradition. Several new studies published 
in this Yearbook deal more extensively with elements of this field 
than most of the studies summarized here. 


221. Almack, J. C. The Adaptation of the School Bulding to 
a Program of Educational Efficiency. Stanford, Ph.D. disserta- 
tion, 1922. (‘‘The relation between class size and efficiency in in- 
struction—an experiment,’’ pp. 31-41. Also in Jour. N. E. A, 
12: 1923, 107-109.) 


A 5B class of 20 pupils was taught in two sections of 10 each dur- 
ing the winter quarter. These were united during spring quarter. A 
5A class of 30 pupils was united during the winter, but divided into two 
groups during the spring. A 7B class had one section with 11 pupils 
and one section with 35, and both sections were kept constant during 
the two quarters. The Monroe Silent Reading tests were given to all 
pupils at the beginning and end of the winter quarter and at the end 
of the spring quarter. The Monroe Revised Diagnostic Arithmetic 
tests were given to the fifth, and the Woody-McCall test to the seventh 
grade. Student cadets were assigned to the different classes and told 
simply to “teach as much as possible.” 

A slightly larger percentage of children showed achievement gains 
in the large classes than in the ‘small, but the difference was so small 
as to be ambiguous in the light of experimental conditions. The classes 
were not equated for initial achievement, excellence of teaching, etc. 
The author concludes: “Class size is not an important factor in deter- 
mining efficiency in teaching as measured by standard tests.” 
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222. *Breed, F. S., and McCarthy, G. D. ‘‘Size of class and 
efficiency of teaching.’’? School and Soc., 4:1916, 965-971. 


After 82 classes from Grades III to VII were drilled in spelling 
for 20 days on 80 words, large classes with an average enrollment of 
45 surpassed small classes with an average enrollment of 27, but the 
percent improvement was only about 2 percent higher in the large 
classes than in the small classes. 

Efficiency, in terms of percent improvement, showed a slight in- 
crease with increment in class size up to classes of 45-49, at which point 
efficiency slightly decreased (for classes of 45-49 and classes of 50-56, 
the largest classes used in the study). 


223. Crabbs, Lelah M. Measuring Efficiency in Supervision 
and Teaching. New York, 1925, 98 pp. (Teachers College, Col- : 
umbia University, Contrib. to Education, No. 175.) 


As a criterion of teaching efficiency in 64 rural classrooms and 20 
urban classrooms, the increase in accomplishment ratios of the 1100 
pupils comprising the classes was taken. Merit rankings were also 
obtained upon the teachers from supervisory officials, and to the rural 
teachers two forms of the Steele-Herring Test of Professional Infor- 
mation (each involving two hours of testing time) were given. 

The change in accomplishment ratio (on a combination of reading, 
arithmetic, spelling, composition, and penmanship) was measured for an 
interval of 12 months in the urban schools, and six months in the rural. 
The Herring-Binet or-the National were the intelligence tests used. 

The mean correlations of the changes in accomplishment ratio of 
five of the nine school subjects with the other four subjects were 
low, viz: 

Reading Arith. Spelling Comp. Penmanship 
TUE D AIDC AT she ofa dict cre al xO .26 14 ay, 28 
BRUT cl esate iin an oicter tae Ses .10 .10 .20 .00 

The author suggests that the low correlations between A. R.’s for 
different subjects may be due to the difference in emphasis of differ- 
ent teachers upon different subjects, yet calls the A. R. method “indis- 
putably the most valid method yet devised for ascertaining the real 
efficiency of the teacher.” 

For rural teachers the correlation between A. R. measures and esti- 
mated efficiency was .33; for urban teachers —.25. 

For the rural teachers the correlation between composite score of 
two Steele-Herring tests and A.R. measure was .05; but estimated 
efficiency with Steele-Herring correlated .41. 

Practically no correlation was found between intelligence level 
or initial A. R.’s of the classes and their A.R. gains; so the author 
assumes that the A.R. measure of teaching efficiency need not make 
allowances for I.Q.’s or initial A. R.’s. The conclusion is drawn that 
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there is no relationship between what teachers know about educational 
ideas or between the supervisor’s estimates of teaching ability and the 
objective A.R. measures of efficiency. 


Comment: ‘The extremely low correlations between A. R. gains on 
the five different subjects calls into question the validity of this measure 
of teaching efficiency. These low correlations, coupled with the fact 
that the A. R. compounds the errors of measurement both of intelligence 
and achievement batteries, and the fact that only six months to one year 
were allowed the teachers to bring about significant changes in their 
pupils, suggest that the author’s conclusion is not thoroughly established. 


224. Edmonson, J. B., and Mulder, F. J. ‘‘Size of class as a 
factor in university instruction.’’ Jour. Educ. Research, 9: 1924, 
1-12. 


Forty-four members of a large class in education (109 university 
students) and 43 members of a small class (45 students), who were 
equated for intelligence on the Army Alpha Test, averaged the same 
on the combined results of quizzes, mid-term examinations, and finals. 
The same instructor taught both groups and the same assistant graded 
their papers. 


225. *Gowen, J. W., and Gooch, M. ‘‘The mental attainments 
of college students in relation to the preparatory school and 
heredity.’’ Jour. Educ. Psych., 17:1926, 408-418. 


Results on nine hundred twenty-seven University of Maine gradu- 
ates are reported. Only high schools represented by two or more Uni- 
versity of Maine graduates between 1913 and 1921, inclusive, were con- 
sidered. “Large high schools” had an average daily attendance of 100 
or more, and “small high schools” less. Data are included, also, con- 
cerning pupils from private academies. 

Following is a table of freshman average grades: 


Advanced Elem. 
Schools English Chem. Algebra Anal.-Geom, Frenoh German 


Mean P.E.m| Mean P.E.m| Mean P.E.m| Mean P.E.m| Mean P.E.m| Mean P.E.m 


Large high...] 82.6 0.3 | 80.1 0.4 | 85.4 0.5 9 0 
Smack ee SesO OO ndaloce OL eOnn Olek) hale (9S. Se Le OMT ei ertetcteteeels 
Academies....| 80.5 0.5 | 79.5 0.6 | 78.3 0.9 ree i 


The authors conclude that “in general . . . it appears that the size 
of the high school has practically no relation to the average college 
marks which its pupils later receive.” 

226. *Stevenson, P. R. Class Size in the Elementary School. 
Ohio State University, 1925, 35 pp. (University Studies, 2: No. 10.) 
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In this experiment 26 pairs of second-grade classes, 20 pairs of 
fifth-grade classes and 16 pairs of seventh-grade classes were used. 
“Small classes” ranged in size from 16 to 32, “large classes” from 32 
to 54. By shifting pupils into large or small classes at the end of the 
first school semester, comparisons were made on the basis of the same 
pupils, taught by the same teachers, when classes were large and when 
they were small. 

All tests were reported in T scores derived from the groups tested 
in the experiment. As many large and small classes were tested during 
the first semester of the school year as during the second semester, so 
that neither large nor small classes would have the benefit of rapid in- 
itial improvement. 

Advantages of large or small classes were reported as “percent that 
difference is of lowest gain” (during one semester). The results are 
summarized herewith in tabular form: 


Subject Size Percent 
Advantage 
Second Grade 
PATILDIME TICLE retssahatsvevers|s [elarelessloreistefove MPmais aicelerersves SM GH oe 8.69 
Mam cua ete. crocs ste. cheseitrererete selere Ait a eee Ai Pangea 19.96 
IREAMING Speedy tare etersin wale cise eicloeTa es mielelele meee WAYS scr. 19.06 
Selim reer cic ere leleteterekeraveyero anes sears neterareeieree Smaller 1.81 
PAIIESUDJCCUS she whens: cierctetithevtkere caoteteterersieloneiniot aie eters Large...... 2.53 
Fifth Grade 
1 Bien OER EN GORE rea BSE RES MOT CRC a Wiarge Sarees 10.97 
Rea Gin espera rman om cicichla eye aeleramlen ere core ares War ges crores 249.78 
EVGading COMmpnrencnsion ecw helt siete tiemietoeieieice IETF ado 118.26 
Ait meticypnoplemaye mits ce ssriclelsisr eee eee eke Tharger ease 1.24 
ANALY SiS# (DLOOIEIIS) km erie sincere cstellstr teen a poral eeareraiee 3.24 
SSroel Lins oes Mien tebe iante, cretcts teak, syvtele sani oversvaeraear, rea M ls tera 25.22 
Alli Subjects mugariscthctis @ ceeraiisechelt os aeetncte ates Margelen wae 22.69 


The author finds that “diminishing returns” begin when the class 
size reaches about 45 pupils. 

The enormous saving that could be effected in large cities if class 
size could be increased by only one or two pupils is pointed out. 


227. Thorndike, E. L., and Kruse, P. J. ‘‘The effect of hu- 
midification of a school room upon the intellectual progress of 
pupils.’’ School and Soc., 5: 1917, 657-660. 


Two divisions of a sixth-grade of 43 pupils, the two being of equal 
initial ability, were so far as possible treated alike in every respect save 
the dryness of the schoo! room. The experiment extended from De- 
cember 4 to March 30, during which time the classroom of the “Dry” 
group received no humidification and averaged about 28.7 percent rela- 
tive humidity, while that of the “Moist” group was kept at about 42.2 
percent relative humidity. 
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On a battery of standard achievement tests given both at the begin- 
ning and end of the experiment, the superiority of one group to another 
was “no greater than its probable error. . . . What slight difference 
there is is in favor of the dry condition.” 


14. Effect of Emotional or Volitional Traits 


As in the ease of the corresponding section in the division on 
intelligence, few experimental data are available. Here appears 
to be a most promising field for future research. 


228. Downey, June. ‘‘Testing the Will-Temperament Tests.’’ 
School and Soc., 16: 1922, 161-168. 


Quoting a footnote, p. 163: “For a group of some seventy fresh- 
men we find a correlation of the Thorndike score and average grade 
for two quarters of .40. A combined score on the Thorndike and total 
for Group Will-Temperament Test raised the correlation to .65.” 


229. Gilchrist, E. P. ‘‘The extent to which praise and reproof 
affect a pupil’s work.’’ School and Soc., 4: 1916, 872-874. 


The Courtis English Test 4B was given to 50 members of a class 
in educational psychology. The class was then divided into two groups 
of 25, placed in separate rooms, and the test repeated. One group was 
told that it did not do so well as the average 12-year-old; the other 
group was told it had done exceptionally well. The first group had 
scores totaling 59.80 and 59.21 on first and second test. The second 
group had 72.42 and 129.50, or an improvement of 79 percent. 


Comment: The measure of improvement by percent is ambiguous. 


230. Heck, W. H. ‘‘Correlation between amounts of home 
study and class marks.’’ School Rev., 25: 1916, 1-17. 


Twenty-seven groups (divided by class, sex, number of units taken, 
etc., and therefore containing some of the same cases more than once) 
comprising over one thousand pupils in four Virginia high schools, 
showed correlations between number of hours of study during a repre- 
sentative week and average of school marks for the month in which 
that week occurred, of from —.11 + .08 to .37+.05. The average of 
all the r’s (for which no group has less than 57 students) is .o84. There 
is enough variability in the hours of study and in the marks to make 
this zero relationship significant. ‘The average minutes of study per 
week vary from about 350 to 750 with S. D.’s from 175 to 350 (approxi- 
mately) in the different groups. The average marks vary from 81 to 87 
with S. D.’s from 6 to 9 (approximately). Intelligence seems to over- 
lay the effect of duration of study. 
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231. *Hurlock, Elizabeth B. ‘‘An evaluation of certain incen- 
tives used in school work.’’ Jour. Educ. Psych., 16: 1925, 145-159. 


The subjects were 58 fourth-grade and 48 sixth-grade pupils in a 
Harrisburg school. They were evenly divided into four groups called 
the “control,” “praised,” “reproved,” and “ignored” groups. The same 
sex distribution obtained in each group, and the same initial ability 
(within .03 point on the author’s modification of the Courtis Research 
Test, addition form). 

Fifteen minutes a day for five days were spent on similar tests, a 
different one each day, consisting of 30 examples of equal difficulty made 
up of six three-place numbers. The “control” group, after the first 
day, was separated from the others and took the remaining tests in an- 
other room. The “praised” group was called in front of the class for 
praise, the “reproved” group was each day reproved before the class 
for its performance, and the “ignored” group was simply ignored. 

Initial scores for the four groups clustered about 11.8. S.D.’s for 
the groups varied from 4.2 to 7.7 on the various days. 


CoMPARISON OF FinAL TESTS 


Difference P.E. Diff.+P.E. 
Difference | Difference 


[RrAISeU=CONLRO Mma ear cite iacietan eaeeie aint: 


8.9 ily. 7.8 
Reproved=cOnerolue rere stein aelelalererteictece 2.8 1.0 PA fs! 
Tgenored=concroleenie eet ect aee 1.0+ 1.0-— bal 


232. Poffenberger, A. T., and Carpenter, F. L. ‘‘Character 
traits in school success.’’ Jour. Exper. Psych., 7:1924, 67-74. 

In a group of 97 elementary-school pupils, those who ranked higher 
on the National Intelligence Test than on school success showed a 
tendency to excel on certain elements of the Downey Will-Temperament 
Test. 

Comment: The results are reported in such a way that the strength 
of the tendency is impossible to evaluate. Neither group dispersions 
of the Will-Temperament scores nor correlations of Will-Temperament 
with success are given. 


233. May, Mark A. ‘‘Predicting academic success.’’ Jour. 
Educ. Psych., 14: 1923, 429-440. 


With a group of 450 Liberal Arts College freshmen a combinatien 
of the Miller Test and Dartmouth Definitions correlated .60 with the 
number of honor points at the end of the first semester. 
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Honor points correlated with 


Number of H. S. units offered at entrance........ 22 
High-school marks or entrance examination...... .40 
Fioufsot week Spentvin Study. ...6...0. 4 sae ev ves 32 
Multipletof four, factors aboves,.c..60.s.+eee sees 84 


Leaving out hours of study, which correlated —.35 with intelligence, 
the multiple R was .64. Intelligence with hours of study constant corre- 
lated .80; with high-school average constant, .53; and with units con- 
stant, .59. 

The conclusion is that a combination of intelligence and hours of 
study is the most reliable means of prediction. Influence of other 
factors is considered proportional to their partials with honor points 
when the remaining factors are partialled out. 


Comment: ‘The conclusion seems sound in the main, but the inter- 
pretations of the partial correlations are subject to some of the difficul- 
ties pointed out in Part I, Chapter II on “Statistical Hazards.” 


234. Gates, A. I. ‘‘Nature and educational significance of 
physical status and of mental, physiological, social, and emotional 
maturity.’ Jour. Educ. Psych., 15: 1924, 329-358. 

In the Horace Mann School 57 fourth-grade pupils were tested by 
the Stanford Binet and a battery of achievement tests, and rated on 


emotional maturity and social maturity by five judges. The reliability 
of the combined ratings was about .87. Correlations found were: 


MEP AMEW ICME SOCIAl gInAtlITILY ciselstelste c olsiaveneieielslslershars .20 
ME Atertoeeriotiotal, maturity... ts ove ase oe .16 
HA ewithrsocial Maturity a. ds. c0 ean «bes Re ante as .19 
Pee witheemotional maturity. sb. «tae es ok coe .20 


Comment: What slight correlations actually obtain here between 
qualities of personality and achievement may be due to causative inter- 
dependence or to “halo-effects” in the teachers’ ratings. 


15. Effect of Segregation According to Ability 


The lack of very pronounced effects suggests that the advantages 
of segregation may be more in the development of habits of work, 
ideals, and interests than in the mere advancement of measurable 
school abilities. 


235. Burtt, H. E., Chassell, L. M., and Hatch, E. M. ‘‘Effi- 
ciency of instruction in unselected sections in elementary psy- 
chology compared with that in sections selected on basis of in- 
telligence tests.’? Jour. Educ. Psych., 14: 1923, 154-161. 
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Three experimental classes having from 35 to 60 students were 
selected for high, medium, and low intelligence, respectively; six con- 
trol classes taught by the same instructors were not sectioned for ability. 
The experimental classes had average percentile scores on a modified 
Army Alpha test of 83, 54, and 21 the first semester, and 84, 54, and 22 
the second semester (based on norms for 2,000 freshmen). 

During the first semester the rate of progress in all sections was 
kept normal, but in the second semester the top and medium sections 
and their two controls were ‘pushed,’ upon the principle that the cor- 
relation between intelligence and course marks would measure efficiency 
of instruction. Product-moment r’s were computed and corrected for 
range by Kelley’s formula. In both the high and medium sections, the 
correlations between intelligence and marks rose during the second 
semester, but they did not in the control groups. The conclusion was 
that segregation was not an advantage unless methods were then used 
especially adapted to ability. 


236. Cobb, Margaret V., and Taylor, Grace A. ‘‘Stanford 
Achievement Tests with a group of gifted children.’? Twenty- 
Third Yearbook of this Society. Part I, 1924, 275-289. 


The average accomplishment ratio on the Stanford Achievement 
Test of 52 children with I.Q.’s of 135 to 190 in “opportunity classes” 
at Manhattan P. S. 165 was 97 after one year of segregation. This 
accomplishment ratio is higher than that usually found for unselected 
gifted children. 


237. Cook, R. R. ‘‘A study of the results of homogeneous 
grouping of abilities in high-school classes.’? Twenty-Third Year- 
book of this Society. Part I, 1924, 302-312. 


Pupils in the Topeka High School were divided, on the basis of 
previous term’s marks in similar subjects or on the basis of Terman 
Group Test scores, into classes containing approximately the best half 
and poorest half of students. Mixed classes were also used for com- 
parison. First-term freshman English, first-term sophomore English, 
plane geometry, and ancient history were dealt with. Two sets of high, 
low, and mixed classes were included for the different subjects—each 
set taught by a different teacher and the high and low sets “equated” 
for time of day. No attempt was made to adapt the subject matter to 
the types of classes taught. 

Scores of averages are not reported. Neither are the numbers of 
pupils in each type of class. These facts would make it seem possible 
to accept only provisionally the author’s conclusions, which are: 

“Pupils of superior ability in geometry did better work in mixed 
classes than they did in selected groups. Their history scores were 
much higher than scores of pupils of the same ability in mixed classes. 
In English the differences were so small as to be negligible.” 
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“Neither can we say that separate grouping is always beneficial to 
slow pupils. In this experiment slow pupils in separate groups obtained 
much better results in geometry, slightly better in English; but in 
history they were much retarded by being grouped separately.” These 
conclusions are based upon tests devised by the teachers and admin- 
istered to all the classes three or four times during the semester. 


238. Laird, D. A. ‘‘A study of the influence of sectioning stu- 
dents upon their achievement.’’ Jour. Educ. Psych., 14: 1923, 
143-153. 


Sections in elementary psychology (1) between 1909 and 1915, and 
(2) between 1915 and 1921 (omitting 1919-20), involving 2700 students, 
were studied. 

During the first period the sections met three times a week for 
lectures or experimental work. During the second period they met 
twice a week as a class and one hour in smaller conferences. Marks 
were from A to F, with C average. During both periods there was a 
nine o’clock section and an eleven o’clock section (whether taught by 
the same instructor is not stated). 

Eleven o’clock sections during the first period had a smaller pro- 
portion of good and poor students. The first semester marks averaged 
2.7 percent A’s and 0.9 percent F’s; the second semester marks aver- 
aged 3.4 percent A’s and 0.9 percent F’s. Corresponding figures for 
the nine o’clock section were 5.2 and 1.8; 6.2 and o.4. Average grades 
were apparently about the same (but not reported) for the nine and 
eleven o’clock sections. No significant differences in grades were found 
during the second period (1915-1921), and differences during the first 
period are less during the second semester. For the first semester of 
1921-22, Thorndike scores correlated with grades .21 for the nine o’clock 
section and .32 for the eleven o’clock section. 


239. Moyer, E. L. ‘‘A study of the effects of classification by 
intelligence tests.’’ Twenty-Third Yearbook of this Society, Part 
I, 1924, 318-322. 


Freshmen pupils were segregated for algebra and Latin classes into 
three sections on the basis of two forms of the National Intelligence 
Test. Owing to conflicts in programs, not all could go into these 
segregated classes, which resulted in “mixed” classes for comparison. 
No attempt was made to differentiate content of instruction. 

The reliability or exact significance of the reported differences is 
unknown, because numbers and dispersions in scores are not reported. 
The author draws the following conclusions: 

(1) In algebra tests (Hotz Algebra Scale) the superior segregated 
pupils are about equal to the superior pupils in mixed classes, but in 
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Latin tests (Henmon Tests) an advantage appears in favor of supe- 
rior pupils in segregated classes. 

(2) In both subjects the medium pupils in superior or in medium 
segregated classes excelled those in mixed classes. 

(3) In algebra the medium pupils in low segregated classes were 
lower than those in mixed sections. 


(4) In algebra the poor pupils, both in low and mixed classes, made 
about the same scores. 


Comment: The conclusions seem to have qualitative validity, even 
if they have quantitative ambiguity. 
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to each February mecting, will be mailed from the office of the publishers, 
only to members whose dues for that year have been paid. Members who 
desire yearbooks prior to the current year must purchase them directly from 
the publishers (see Item 8). 

8. Commercial Sales. The distribution of all yearbooks prior to the 
current year, and also those of the current year not regularly mailed to 
members in exchange for thcir dues, is in the hands of the publishers, not 
of the secretary. For such commercial sales, communicate directly with 
the Publie School Publishing Company, Bloomington, Illinois, who will gladly 
send a price list covering all the publications of this Society and of its 
predecessor, the National Herbart Society. Bound sets can be supplied of 
the first twenty-six Yearbooks. 

9. Yearbooks. The yearbooks are issued in parts (usually two) from 
one to four months before the February meeting. They comprise from 
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500 to 800 pages annually. Unusual effort has been made to make them, 
on the one hand, of immediate practical value, and on the other hand, 
representative of sound scholarship and scientific investigation. Many of 
them are the fruit of codperative work by committees of the Society. 

10. Meetings. The annual meetings, at which the yearbooks are dis- 
cussed, are held in February at the same time and place as the meeting of 
the Department of Superintendence of the National Education Association. 

Applications for membership will be handled promptly at any time on 
receipt of name and address, together with check for the appropriate amount 
($3.00 for new active membership, $2.50 for new associate membership). 
New members will receive, about three weeks after their application is 
mailed, the yearbook slated for discussion at the February meeting of the 
year in which the application is made (except in the case of December 
applications, as explained in Item 3). 


GUY M. WHIPPLE, Secretary-Treasurer. 


10 Putnam Street, 
Danvers, Mass. 
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